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Expression of Group B Neisseria meningitidis 
Outer Membrane (MB3) Protein from 
Yeast and Vaccines 



Background of the Invention 
Field of the Invention 

The present invention is in the field of recombinant genetics, protein 
expression, and vaccines. The present invention relates to a method of expressing 
in a recombinant yeast host an outer membrane group B porin protein from 
Neisseria meningitidis. The invention also relates to a vaccine comprising group 
A meningococcal polysaccharide (GAMP), group B meningococcal 
polysaccharide (GBMP) and group C meningococcal polysaccharide (GCMP) 
antigens, together with a pharmaceutical ly acceptable carrier. The invention also 
relates to a method of inducing an immune response in a mammal, comprising 
administering the above-mentioned vaccine to a mammal in an amount sufficient 
to induce an immune response. 

Background Information 

Meningococcal meningitis remains a worldwide problem, and occurs in 
both endemic and epidemic forms (Peltola, H., Rev. Infect. Dis 5:71-91 (1983); 
Gotschlich, E.C., "Meningococcal Meningitis," in Bacterial Vaccines, Germanier, 
E., ed., Academic, New York (1984), pp.237-255). Epidemic disease occurs in 



all parts of the world and incidences as high as 500 per 100,000 population have 
been reported. Without antibiotic treatment the mortality is extremely high 
(85%), and even with antibiotics, it remains at approximately 1 0%. In addition, 
patients cured by antibiotic therapy can still suffer serious and permanent 
neurologic deficiencies. These facts together with the emergence of sulfadiazine- 
resistant strains of Neisseria meningitidis promoted the rapid development of a 
commercial vaccine (Peltola, H., Rev. Infect. Dis. 5:71-91 (1983)). 

Neisseria meningitidis is a gram-negative organism that has been 
classified serologically into groups A, B. 29e, W135, X, Y. and Z (Gotschlich, 
E.C., "Meningococcal Meningitis," in Bacterial Vaccines, Germanier, E., ed.. 
Academic, New York (1984), pp. 237-255). Additional groups H, I, and K were 
isolated in China (Ding, S.-Q. et al, J. Biol Stand. 9:307-3 1 5 ( 1 98 1 )) and group 
L was isolated in Canada (Ashton, F.E. et ai, J. Clin. Microbiol. 7 7:722-727 
(1983)). The grouping system is based on the organisms' capsular 
polysaccharides. It was established (Lui, T.-Y. et al.< J Biol. Chem. 246.2M9- 
2858 (1971)) that the group A polysaccharide is a partially O-acetylated (1-6) 
linked homopolymer of 2-acetamido-2-detoxy-D-mannopyranosy 1 phosphate, and 
that both groups B and C polysaccharides are homopolymers of sialic acid. 

N. meningitidis groups A, B, and C are responsible for approximately 
90% of cases of meningococcal meningitis. Success in the prevention of group 
A and C meningococcal meningitis was achieved using a bivalent polysaccharide 
vaccine (Gotschlich, E.C. et al., J. Exp. Med. 1 29: 1367- 1384 (1969); Artenstein. 
M.S. et al. t N. Engl. J. Med. 252:417-420 (1970)); this vaccine became a 
commercial product and has been used successfully in the last decade in the 
prevention and arrest of major meningitis epidemics in many parts of the world. 
However, there has been a need to augment this vaccine because a significant 
proportion of cases of meningococcal meningitis are due to groups other than A 
and C. Group B is of particular epidemiologic importance, but groups Y and 
W135 are also significant (Cadoz, M. et al., Vaccine 3:340-342 (1985)). The 
inclusion of the group B polysaccharide in the vaccine has been a special problem 
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(see below); however, a tetravalent vaccine comprising groups A, C, Wl 35, and 
Y has proven to be safe and immunogenic in humans (Cadoz, M. et al. Vaccine 
5:340-342 (1985)) and is the currently used meningococcal meningitis vaccine 
(Jennings, H J., "Capsular Polysaccharides as Vaccine Candidates," in Current 
Topics in Microbiol and Immunol, Jann, D. and Jann, B., eds, Springer- Verlag, 
Berlin (1990) Vol 150:97-127). 

The outer membranes of Neisseria species much like other Gram negative 
bacteria are semi -permeable membranes which allow free flow access and escape 
of small molecular weight substances to and from the periplasmic space of these 
bacteria but retard molecules of larger size (Heasiey, F.A., et al, "Reconstitution 
and characterization of the N gonorrhoeae outer membrane permeability barrier," 
in Genetics and Immunobiology of Neisseria gonorrhoeae, Danielsson and 
Normark, eds., University of Umea, Umea, pp. 1 2- 1 5 ( 1 980); Douglas, J.T., et al , 
FEMS Microbiol Lett. 72:305-309(1981)). One of the mechanisms whereby this 
is accomplished is the inclusion within these membranes of proteins which have 
been collectively named porins. These proteins are made up of three identical 
polypeptide chains (Jones, R.B., et al. Infect. Immun. 30:773-780 (1980): 
McDade, Jr. and Johnston, ./ Bacteriol 7-/7:l 1 83-1 1 91 (1980)) and in their 
native trimer conformation, form water filled, voltage-dependent channels within 
the outer membrane of the bacteria or other membranes to which they have been 
introduced (Lynch, E.G., et al, Biophys. J. 41:62 (1983); Lynch, E.C., et al, 
Biophys. J. 45:104-107 (1984); Young, J.D.E., et al, Proc. Natl Acad Sci. USA 
50:3831-3835 (1983); Mauro, A., etal, Proc. Natl Acad. Sci. USA #5:1071-1075 
(1988); Young, J.D., et al, Proc. Natl Acad. Sci. USA #5:150-154 (1986)). 
Because of the relative abundance of these proteins within the outer membrane, 
these protein antigens have also been used to subgroup both Neisseria 
gonorrhoeae and Neisseria meningitidis into several serotypes for 
epidemiological purposes (Frasch, C.E., etal. Rev. Infect. Dis. 7:504-510 (1985); 
Knapp, J.S., et al, "Overview of epidemiological and clinical applications of 
auxotype/serovar classification of Neisseria gonorrhoeae:' The Pathogenic 



Neisserias Schoolnik, G.K., ed., American Society for Microbiology, 
Washington, pp. 6-12 (1985)). To date, many of these proteins from both 
gonococci and meningococci have been purified (Heckels, J.E., J. Gen. 
Microbiol 99:333-341 (1977); James and Heckels, J. Immunol Math. 42:223-228 
(1981); Judd, R.C., Anal Biochem. 7 75:307-316 (1988); Blake and Gotschlich. 
Infect. Immun. 36:277-283 (1982): Wetzler, L.M., et al y J. Exp. Med 765:1883- 
1897 (1988)), and cloned and sequenced (Gotschlich, E.C., et al, Proc. Natl. 
Acad. ScL USA 54:8135-8139 (1987); McGuinness, B., et al, J. Exp. Med 
777:1871-1882 (1990); Carbonetti and Sparling, Proc. Natl Acad Sci. USA 
54:9084-9088 (1987); Feavers, I.M., etal, Infect. Immun. 60:3620-3629 ( 1 992); 
Murakami, K., et al. f Infect. Immun. 57:2318-2323 (1989): Wolff and Stern, 
FEMS Microbiol Lett. 53:179-186 (1991 ); Ward, M.J., et al n FEMS Microbiol 
Lett. 73:283-289(1992)). 

The porin proteins were initially co-isolated with lipopolysaccharides 
(LPS). Consequently, the porin proteins have been termed "endotoxin-associated 
proteins" (Bjornson etal, Infect. Immun. 56:1602-1607 (1988)). Studies on the 
wild type porins have reported that full assembly and oligomerization are not 
achieved unless LPS from the corresponding bacterial strain is present in the 
protein environment (Holzenburg et al, Biochemistry 25:41 87-4193 (1989): Sen 
and Nikaido, J. Biol Chem. 266:1 1295-1 1300 (1991)). 

The meningococcal porins have been subdivided into three major 
classifications which in antedated nomenclature were known as Class 1 , 2, and 
3 (Frasch, C.E., et al, Rev. Infect. Dis. 7:504-510 (1985)). Each meningococcus 
examined has contained one of the alleles for either a Class 2 porin gene or a 
Class 3 porin gene but not both (Feavers, I.M, et al, Infect. Immun. 6(9:3620- 
3629 (1992)); Murakami, K., et al, Infect. Immun. 57:2318-2323 (1989)). The 
presence or absence of the Class 1 gene appears to be optional. Likewise, all 
probed gonococci contain only one porin gene with similarities to either the Class 
2 or Class 3 allele (Gotschlich, E.C.. et al, Proc. Natl Acad. Sci. USA 54:81 35- 
8139 (1987); Carbonetti and Sparling, Proc. Natl Acad. Sci. USA 54:9084-9088 
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(1987)). N. gonorrhoeae appear to completely lack the Class 1 allele. The data 
from the genes that have been thus far sequenced would suggest that all neisserial 
porin proteins have at least 70% homology with each other with some variations 
on a basic theme (Feavers, I.M., et al, Infect, hnmun. (50:3620-3629 (1992)). It 
has been suggested that much of the variation seen between these neisserial porin 
proteins is due to the immunological pressures brought about by the invasion of 
these pathogenic organisms into their natural host, man. However, very little is 
known about how the changes in the porin protein sequence effect the functional 
activity of these proteins. 

It has been previously reported that isolated gonococcal porins of the 
Class 2 allelic type behave electrophysically somewhat differently than isolated 
gonococcal porins of the Class 3 type in lipid bilayer studies both in regards to 
their ion selectivity and voltage-dependence (Lynch, EC, et al, Biophys. J. 
41:62 (1983); Lynch, E.C., etai, Biophys. J. 45: 1 04- 107 (1984)). Furthermore, 
the ability of the different porins to enter these lipid bi layers from intact living 
bacteria seems to correlate not only with the porin type but also with the 
neisserial species from which they were donated (Lynch, E.C., et al, Biophys. J. 
45: 104- 107 (1984)). It would seem that at least some of these functional 
attributes could be related to different areas within the protein sequence of the 
porin. One such functional area, previously identified within all gonococcal 
Class 2-like proteins, is the site of chymotrypsin cleavage. Upon chymotrypsin 
digestion, this class of porins lack the ability to respond to a voltage potential and 
close. Gonococcal Class 3-like porins as well as meningococcal porins lack this 
sequence and are thus not subject to chymotrypsin cleavage but nonetheless 
respond by closing to an applied voltage potential (Greco, F.. "The formation of 
channels in lipid bilayers by gonococcal major outer membrane protein," thesis. 
The Rockefeller University, New York (1981); Greco, F.. et ai. Feci Proc. 
JP.1813 (1980)). 

As the Neisseria porins are among the most abundant proteins present in 
the outer membrane of these organisms, and as they do not undergo antigenic 



shift during infection (unlike several other major surface antigens), their universal 
presence in both Neisseria meningitidis and Neisseria gonorrhoea, as well as 
their exposure at the surface, make them candidates for components of vaccines 
against these organisms. Patients convalescing from meningococcal disease 
produce anti-porin antibodies, and antibodies elicited by immunization with porin 
proteins are bactericidal to homologous serotypes. Furthermore, within a 
particular epidemiologic setting, most strains causing meningococcal disease 
belong to a limited number of serotypes, notably serotype 2 among strains with 
a class 2 protein and serotype 1 5 among strains with class 3 proteins. Therefore, 
class 2 and 3 proteins are attractive candidates for vaccines. 

The major impediment for such studies has been the ability to easily 
manipulate the porin genes by modern molecular techniques and obtain sufficient 
purified protein to carry out the biophysical characterizations of these altered 
porin proteins- It was early recognized that cloned neisseria! porin genes, when 
expressed in Escherichia coli, were lethal to the host E. coli (Carbonetti and 
Sparling, Proc. Natl. Acad ScL USA 5-/:9084-9088 (1987); Carbonetti, N.H., 
et al, Proc. Nad. Acad ScL USA 55:6841-6845 (1988); Barlow, A.K., et al. 
Infect. Immun. 55:2734-2740 (1987)). Thus, many of these genes were cloned and 
sequenced as pieces of the whole gene or placed into low copy number plasmids 
under tight expression control (Carbonetti, N.H., et al. % Proc. Natl. Acad. Sci. 
USA 55:6841-6845 (1988)). Under these conditions, even when the entire porin 
gene was expressed, very little protein accumulated that could be further purified 
and processed for characterization. 

Another tack to this problem which has met with a modicum of success 
has been to clone the porin genes into a low copy, tightly controlled expression 
plasmid, introduce modifications to the porin gene, and then reintroduce the 
modified sequence back into Neisseria (Carbonetti, N.H., et ah . Proc. Nail. Acad. 
Sci. USA 55:6841-6845 (1988)). However, this has also been fraught with 
problems due to the elaborate restriction endonuclease system present in 
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Neisseria, especially gonococci (Davies, J.K., Clin. Microbiol. Rev. 2:S78-S82 
(1989)). 

While a vaccine comprising neisserial porin has long been sought, an 
effective meningococcal polysaccharide vaccine which would give complete 
coverage to all serogroup organisms and to all humans is also needed. Several 
serious problems remain in the development of such a broad range polysaccharide 
vaccine. First, it has been established that although the group A and C 
polysaccharides are efficacious in adults and older children, their effectiveness 
in infants has only been marginal (Goldschneider, I., et aL, J. Infect. Dis. 
725:769-776 (1973); Gotschlich, E.C., et aL, "The Immune Responses to 
Bacterial Polysaccharides in Man," In: Antibodies in Human Diagnosis and 
Therapy, Haber, E. and Krause, R.M., eds.. Raven, New York (1977), pp. 391- 
402). Second, the group B meningococcal polysaccharide is only poorly 
immunogenic in man (Wyle, F.A., et aL, J. Infect. Dis. 726:514-521 (1972)). A 
third problem is the tendency for multivalent vaccines to be less immunogenic 
than each component would be if administered individually (Insel, R.A., 
"Potential alterations in immunogenicity by combining or simultaneously 
administering vaccine components," In: Annals of the New York Academy of 
Sciences, Vol. 754. Combined Vaccines and Simultaneous Administration: 
Current Issues and Perspectives, Williams. J.C., et aL. eds, New York Academy 
of Sciences, New York (1993), pp. 35-47; Clemens, J., et aL, "Interactions 
between PRP-T vaccine against Haemophilus influenzae type b and conventional 
infant vaccines: lessons for future studies of simultaneous immunization and 
combined vaccines," In: Annals of the New York Academy of Sciences, Vol. 754. 
Combined Vaccines and Simultaneous Administration: Current Issues and 
Perspectives, Williams, J.C., el aL, eds, New York Academy of Sciences, New 
York (1993), pp. 255-266; Paradiso, P.R., et aL, Pediatrics 92^:827-832 
(1993)). 

Presently available vaccines against group A and C N. meningitidis are 
poorly immunogenic in human infants (age two and under) because, in contrast 



to the immunity generated by most antigens, a polysaccharide-specific immune 
response in infants is T-cell-independent. In the absence of T-cell involvement, 
an immune response is of short duration. More importantly, no memory is 
demonstrable, so no "booster" reactions occur. Furthermore, antibody affinity 
maturation does not occur. These deficiencies are due to the inability of 
polysaccharides to specifically bind to T-cells. Presumably, the structural 
features required for association with a T-cell receptor do not exist in the majority 
of polysaccharides. Because of the T-cell independent nature of the immune 
response, the antibody response to a polysaccharide in infants is limited to 
antibodies of the IgM isotype; the isotype switching necessary for production of 
non-IgM antibodies requires T-cell involvement. Polysaccharide antigens present 
less of a problem in more mature humans (over age two), as they are able to 
induce antibodies of the IgG isotype as well as IgM (Yount el a/., J. Exp. Med. 
727:633-646 (1968)). 

The group B meningococcal polysaccharide is even less immunogenic in 
humans of all ages than other polysaccharides. Two major explanations have 
been proposed to account for this characteristic (Jennings, H. J., Adv. Carhohydr. 
Chem. Biochem. 47:155-208 (1983); Lifely, M.R. et at., Vaccine 5:11-26 (1987)). 
One is that the group B meningococcal polysaccharide, an a-(2— ►8)-linked sialic 
acid homopolymer, is rapidly depolymerized in human tissue because of the 
action of neuraminidase. The other is that the structure is recognized as "self by 
the human immune system and in consequence, the production of antibody 
specific for this structure is suppressed. The weight of evidence is in favor of the 
latter explanation because a neuraminidase-sensitive variant of the group C 
meningococcal polysaccharide [an a-(2— ►9)-linked sialic acid homopolymer] still 
proved to be highly immunogenic in man (Glode, M.P. et aL ./. Infect. Dis. 
J 39:52-59 (1979)). In addition it was demonstrated that conjugation of the group 
B polysaccharide to a protein carrier (tetanus toxoid) through its terminal 
nonreducing sialic acid, which stabilizes the polysaccharide to neuraminidase, did 
not result in any significant enhancement in its immunogenicity (Jennings, H.J. 
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and Lugowski, C, J. Immunol 727:101 1-1018 (1981)). The above observations 
are consistent with a theory that the immune mechanism avoids the production 
of antibody having a specificity for the a-(2-+8)-linked sialic acid residues. This 
theory was farther confirmed by the identification of a-(2->8)-linked sialic acid 
residues in the oligosaccharides of human and animal tissue. A novel approach 
to overcoming the poor immunogenicity of the group B polysaccharide has been 
to modify it chemically. 

The T-cell independent quality of polysaccharide antigens in infant 
humans can be overcome by conjugating (covalently coupling) the polysaccharide 
to a protein carrier. The capsular polysaccharides of the bacteria primarily 
responsible for postneonatal meningitis have been conjugated to protein carriers; 
these include type b H. influenzae (Schneerson, R. et aL J. Exp. Med. 752:361- 
376 (1980); Anderson, P.W., Infect. Immun, 59:233-238 (1983); Marburg, S. et 
aL, J. Am. Chem. Soc. 705:5282-5287 (1986)), group A (Jennings, H.J. and 
Lugowski, C, J. Immunol. 727:101 1-1018 (1981)) ; Beuvery, E.C. et a/., Vaccine 
7:31-36 (1983)), B (Jennings, H.J. and Lugowski, C, J. Immunol. 727:101 1-1018 
(1981)), and C (Jennings, H.J. and Lugowski, C, J. Immunol. 727:101 1-101 8 
(1981)); Beuvery, E.C. et aL Infect. Immun. ^0:39-45 (1983)) TV. meningitidis. 
and type 6A Strep, pneumoniae (Chu, C. et aL, Infect. Immun. ^0:245-256 
(1983)). For the choice of carrier protein most investigators have used tetanus 
toxoid or diphtheria toxoid, two proteins currently used as infant vaccines. A 
recent innovation on this theme has been the use of a mutant-derived diphtheria 
toxin (CRM l97 ) (Anderson, P.W., Infect. Immun. JP:233-238 (1983)) which is 
nontoxic. The significance of this protein is that because it does not require 
detoxifying by treatment with formaldehyde, all its amino groups remain 
underivatized, which greatly facilitates the conjugation process. 

The use of other potential bacterial proteins as carriers has not been 
extensively explored. However, a serotype outer member protein of A'. 
meningitidis has been used as a protein carrier (Marburg, S. et aL, J. Am. Chem. 
Soc. 70^:5282-5287 (1986)). 



- 10- 



In light of the foregoing, it will be clear that there is a significant need for 
a process by which large quantities of the outer membrane group B porin proteins 
of N. meningitidis can be obtained. It will also be clear that a need exists for a 
polysaccharide vaccine which would give complete coverage to the three major 
serogroups of TV. meningitidis, groups A, B and C, and which would provide 
immunity against these organisms to both infants and more mature humans. 

Summary of the Invention 

It is a general object of the invention to provide a method of expressing 
in yeast the meningococcal group B porin proteins, in particular, the class 3 porin 
protein. 

It is a specific object of the invention to provide a method of expressing 
the outer membrane meningococcal group B porin protein or a fusion protein 
thereof in yeast, comprising: 

(a) cloning into a plasmid having a selectable marker a gene coding 
for a protein selected from the group consisting of: 

(i) a mature porin protein 

(ii) a fusion protein which is a mature porin protein fused 
to a yeast secretion signal peptide; 

wherein said gene is operably linked to a yeast promoter; 

(b) transforming said plasmid containing said gene into a yeast strain; 

(c) selecting the transformed yeast by growing said yeast in a culture 
medium allowing selection of said transformed yeast; 

(d) growing the transformed yeast, and 

(e) inducing expression of said protein to give yeast containing 
said protein; 

wherein the protein so expressed comprises more than about 2% of the total 
protein expressed in said yeast. 
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lt is another specific object of the invention to provide a method of 
expressing a mature porin protein or fusion protein thereof wherein the protein 
so expressed comprises about 3-5% of the total protein expressed in yeast. 

It is yet another specific object of the invention to provide a method of 
expressing a mature porin protein wherein the protein is the Neisseria 
meningitidis outer membrane meningococcal group B porin protein (MB3). 

It is another specific object of the invention to provide a method of 
expressing a mature porin protein or fusion protein thereof wherein the yeast 
promoter is the AOX1 promoter. 

It is another specific object of the invention to provide a method of 
expressing the outer membrane meningococcal group B porin protein or a fusion 
protein thereof in yeast, wherein the yeast secretion signal peptide is selected 
from the group consisting of the secretion signal of the S. cerevisiae a-mating 
factor prepro-peptide and the secretion signal of the P. pastoris acid phosphatase 
gene (PHO). 

It is yet another specific object of the invention to provide a method of 
expressing MB3 or a fusion protein thereof in yeast as described above, wherein 
the plasmid is selected from the group consisting of pHIL-D2, pHIL-Sl, pPIC9 
and pPIC9K. 

It is a further specific object of the invention to provide a method of 
expressing the above-described meningococcal group B porin protein or fusion 
protein wherein at least one codon of the 5 ' region of the gene encoding said 
protein has been changed so as to be optimized for yeast codon usage. 

It is still a further specific object of the invention to provide a method of 
expressing the above-described meningococcal group B porin protein or fusion 
protein wherein the 5' region of the gene encoding said protein comprises a 
nucleotide sequence that incorporates codons optimized for P. pastoris codon 
usage. 

It is another specific object of the invention to provide a method as 
described above wherein the codon changes are selected from the group of 
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changes consisting of: (GTT to GTC at positions 4-6 of the native sequence), 
(ACC to ACT at positions 7-9 of the native sequence), (CTG to TTG at positions 
10-12 of the native sequence), (GGC to GGT at positions 16-18 of the native 
sequence), (ACC to ACT at positions 19-21 of the native sequence), (ATC to 
ATT at positions 22-24 of the native sequence), (AAA to AAG at positions 25-27 
of the native sequence), (GCC to GCT at positions 28-30 of the native sequence), 
(GGC to GGT at positions 31-33 of the native sequence), (GTA to GTT at 
positions 34-36 of the native sequence), (GAA to GAG at positions 37-39 of the 
native sequence); wherein said positions are numbered from the first nucleotide 
of the native nucleotide sequence encoding said protein. 

It is another specific object of the invention to provide a method as 
described above wherein the 5' region of the gene includes codons optimized for 
P. pastoris codon usage, and wherein the nucleotide sequence is SEQ ID NO: 26. 

It is another specific object of the invention to provide a method of 
expressing the above-mentioned protein wherein the yeast secretes the protein or 
fusion protein. 

It is another specific object of the invention to provide a method of 
expressing the above-mentioned protein wherein the vector from which the 
secreted protein is expressed is selected from the group consisting of pHIL-Sl, 
pPIC9, and pPIC9K. 

It is another specific object of the invention to provide a method of 
purifying insoluble, intracellular outer membrane meningococcal group B porin 
protein or fusion protein thereof obtained according to the invention comprising: 

(a) lysing the yeast described above which has expressed the 
protein to release said protein as an insoluble membrane bound 
fraction; 

(b) washing the insoluble material obtained in step (a) with 
buffers to remove contaminating yeast cellular proteins; 

(c) suspending and dissolving said insoluble fraction obtained 
in step (b) in aqueous solution of denaturant; 
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(d) diluting the solution obtained in step (c) with a detergent; 
and 

(e) purifying said protein by gel filtration and ion exchange 
chromatography . 

It is another specific object of the invention to provide a method of 
purifying the outer membrane meningococcal group B porin protein or fusion 
protein thereof obtained according to the invention comprising: 

(a) centrifuging the yeast culture described above which has 
expressed the protein to isolate the protein as soluble secreted 
material; 

(b) removing contaminating yeast culture impurities from 
the soluble secreted material obtained in step (a) by precipitating 
said impurities with about 20% cthanol, wherein the soluble 
secreted material remains in the soluble fraction; 

(c) precipitating the secreted material from the soluble fraction 
of step (b) with about 80% ethanol; 

(d) washing the precipitated material obtained in step (c) with 
a buffer to remove contaminating yeast secreted proteins; 

(e) suspending and dissolving the precipitated material 
obtained in step (d) in an aqueous solution of detergent; and 

(f) purifying the protein by ion exchange chromatography. 
It is a further specific object of the invention to provide a yeast host cell 

that contains a gene coding for a protein selected from the group consisting of: 

(a) a mature porin protein 

(b) a fusion protein which is a mature porin protein fused to a 
yeast secretion signal peptide; 

wherein said gene is operably linked to a yeast promoter. 

It is still another specific object of the invention to provide a yeast host 
cell as described above which is capable of expressing the Neisseria meningitidis 
mature outer membrane class 3 protein of serogroup B (MB3). 
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It is still another specific object of the invention to provide a yeast host 
cell as described above wherein the yeast promoter is the AOX1 promoter. 

It is another object of the invention to provide a vaccine comprising group 
A meningococcal polysaccharide (GAMP), group B meningococcal poly- 
saccharide (GBMP), and group C meningococcal polysaccharide (GCMP) 
antigens, together with a pharmaceutical ly acceptable carrier. 

It is still another specific object of the invention to provide a method of 
inducing an immune response in a mammal, comprising administering to a 
mammal the above-described vaccine in an amount sufficient to induce an 
immune response in a mammal. 

Further objects and advantages of the present invention will be clear from 
the description that follows. 

Brief Description of the Drawings 

Figure 1 : A diagram showing the sequencing strategy of the PorB gene. 
The PCR product described in Example 1 (Materials and Methods section) was 
ligated into the BamY{\-Xho\ site of the expression plasmid pET-1 7b. The initial 
double stranded primer extension sequencing was accomplished using 
oligonucleotide sequences directly upstream of the BamUl site and just 
downstream of the Xhol site within the pET-1 7b plasmid. Additional sequence 
data was obtained by making numerous deletions in the 3' end of the gene, using 
exonuclease III/mung bean nuclease reactions. After religation and 
transformation back into E. coli, several clones were selected on size of insert and 
subsequently sequenced. This sequencing was always from the 3' end of the gene 
using an oligonucleotide primer just downstream of the Bpul 1021 site. 

Figure 2: A gel electrophoresis showing the products of the PCR reaction 
(electrophoresed in a 1% agarose using TAE buffer). 

Figures 3A and 3B. Fig. 3A: SDS-PAGE analysis of whole cell lysates 
of E. coli hosting the control pET-17b plasmid without inserts and an E. coli 
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clone harboring pET-17b plasmid containing an insert from the obtained PCR 
product described in the materials and methods section. Both cultures were 
grown to an O.D. of 0.6 at 600 ran, IPTG added, and incubated at 37 °C for 2 hrs. 
L5 mis of each of the cultures were removed, centrifuged, and the bacterial pellet 
solubilized in 100 jal of SDS-PAGE preparation buffer. Lane A shows the 
protein profile obtained with 10 jul from the control sample and Lanes B (5 
and C (10 jil) demonstrate the protein profile of the E. coli host expressing the 
PorB protein. Fig. 3B: Western blot analysis of whole cell lysates of E. coli 
harboring the control pET-17b plasmid without insert after 2 hrs induction with 
IPTG, Lane A, 20 \i\ and a corresponding E. coli clone containing a porB-pET- 
17b plasmid, Lane B, 5 jil; Lane C. 10 \x\\ and Lane D, 20 The monoclonal 
antibody 4D1 1 was used as the primary antibody and the western blot developed 
as described. The pre-stained low molecular weight standards from BRL were 
used in each case. 

Figure 4: The nucleotide sequence and the translated amino acid sequence 
of the mature PorB gene cloned into the expression plasmid pET-1 7b. The two 
nucleotides which differ from the previously published serotype 15 PorB are 
underlined. 

Figure 5: A graph showing the Sephacryl S-300 column elution profile 
of both the wild type Class 3 protein isolated from the meningococcal strain 8765 
and the recombinant Class 3 protein produced by BL21(DE3) -AompA E. coli 
strain hosting the r3pET-17b plasmid as monitored by absorption at 280nm and 
SDS-PAGE analysis. The void volume of the column is indicated by the arrow. 
Fractions containing the meningococcal porin and recombinant porin as 
determined by SDS-PAGE are noted by the bar. 

Figure 6: A graph showing the results of the inhibition EL1SA assays 
showing the ability of the homologous wild type (wt) PorB to compete for 
reactive antibodies in six human immune sera. The arithmetic mean inhibition 
is shown by the bold line. 
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Figure 7: A graph showing the results of the inhibition ELISA assays 
showing the ability of the purified recombinant PorB protein to compete for 
reactive antibodies in six human immune sera. The arithmetic mean inhibition 
is shown by the bold line. 
5 Figure 8: A graph showing a comparison of these two mean inhibitions 

obtained with the wt and recombinant PorB protein. 

Figure 9A and 9B: The nucleotide sequence and the translated amino acid 
sequence of the mature class II porin gene cloned into the expression plasmid 
pET-17b. 

10 Figure 10A and 10B: The nucleotide sequence and the translated amino 

acid sequence of the fusion class II porin gene cloned into the expression plasmid 
pET-17b. 

Figure 1 1 (panels A and B): Panel A depicts the restriction map of the 
pET-1 7b plasmid. Panel B depicts the nucleotide sequence between the BglU and 

15 Xhol sites of pET-17b. The sequence provided by the plasmid is in normal print 

while the sequence inserted from the PCR product are identified in bold print. 
The amino acids which are derived from the plasmid are in normal print while the 
amino acids from the insert are in bold. The arrows demarcate where the 
sequence begins to match the sequence in Figure 4 and when it ends. 

20 Figure 12: A graph showing the level of expression of MB 3 for clone pnv 

322, where the expression vector used is pHIL-D2. The level of MB3 expressed 
is depicted as mg of insoluble MB3 per gram of cell pellet per unit time. 

Figure 13 A: The DNA sequence and translated amino acid sequence of 
pNV15 (MB3 in pET24a) before codon preference optimization. 

25 Figure 13B: The DNA sequence and translated amino acid sequence of 

Men.Class3 opt. (MB3 optimized for yeast codon preference). 

Figures 14A and 14B: Graphs showing the elution of MB3 from a size 
exclusion column. MB3 expressed in an intracellular form was purified by a 
denaturation/renaturation protocol, followed by gel filtration and ion exchange 

30 chromatography. The resultant purified protein exhibited by size exclusion 
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chromatography an elution profile which resembles the recombinant class 3 
protein overexpressed in £. coli, and both give the same elution profile as the 
native wild-type counterpart. This indicates that MB3 refolds and oligomerizes, 
achieving full native conformation. 14(A): the elution profile of MB3; 14(B): the 
elution profile of class 3 protein expressed and refolded from E. coli inclusion 
bodies. 

Figure 15: A graph showing the size exclusion chromatography of 
purified MB3 on a Superose 12 (Pharmacia) column connected to an HPLC 
(Hewlett Packard model 1090). Based on the comparison of MB3 with the native 
wild-type counterpart, as well as calibration using molecular weight standards 
(designated as arrows), the elution profile is indicative of trimeric assembly. 
Molecular weight markers are: 1 = thyroglobulin (670,000); 2 = gammaglobulin 
(158,000); 3 = myoglobin (17,000). 

Figures 16A, 16B and 16C: The DNA sequence of clone pnv 322. This 
clone has the MB3 gene inserted into the EcoRl site of the Invitrogen expression 
vector pHIL-D2. MB3 is thus inserted directly downstream from the A OX I 
promoter. This construct allows intracellular expression. Vector sequences are 
shown in upper case letters, while the MB3 sequence is given in lower case 
letters. 

Figures 17 A, 17B and 17C: The DNA sequence of clone pnv 318. This 
clone has the MB 3 gene inserted into the XhoI-BamHI sites of the Invitrogen 
expression vector pHIL-Sl . MB3 is thus inserted directly downstream from the 
PHOl leader peptide, in frame with the secretion signal open reading frame for 
secretion of expressed protein. Vector sequences are shown in upper case letters, 
while the MB3 sequence is given in lower case letters. 

Figures 18A, 18B and 18C: The DNA sequence of clone pnv 342. This 
clone has the MB3 gene inserted into the EcoRl-Avrll sites of the Invitrogen 
expression vector pPIC-9. MB 3 is thus inserted directly downstream from the 
secretion signal of a-factor prepro peptide, for secretion of expressed protein. 
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VecJtor sequences are shown in upper case letters, while the MB3 sequence is 
given in lower case letters. 

Figures 19A, 19B and 19C: The DNA sequence of clone pnv 350. This 
clone has the MB3 gene inserted into the EcoRA-Avrll sites of the Invitrogen 
expression vector pPIC-9K. MB3 is thus inserted directly downstream from the 
secretion signal of a-factor prepro peptide, for secretion of expressed protein. 
Vector sequences are shown in upper case letters, while the MB3 sequence is 
given in lower case letters. 

Figure 20: A graph showing the absorbance spectra (electropherogram) 
of GAMP, TT-monomer, and GAMP-TT conjugate. 

Figure 21 : A graph showing the absorbance spectra (electropherogram) 
of GCMP, TT-monomer, and GCMP-TT conjugate. 

Figure 22: A graph showing the A-specific IgG ELISA titer elicited by 
monovalent (A) and trivalent (A/B/C) meningococcal conjugate vaccines in mice. 

Figure 23: A graph showing the B-specific IgG ELISA titer elicited by 
monovalent (A) and trivalent (A/B/C) meningococcal conjugate vaccines in mice. 

Figure 24: A graph showing the C-specific IgG ELISA titer elicited by 
monovalent (C) and trivalent (A/B/C) meningococcal conjugate vaccines in mice. 

Figure 25: A graph showing the A-specific bacteriocidal activity elicited 
by monovalent (A) and trivalent (A/B/C) meningococcal conjugate vaccines in 
mice. 

Figure 26: A graph showing the B-specific bacteriocidal activity elicited 
by monovalent (A) and trivalent (A/B/C) meningococcal conjugate vaccines in 
mice. 

Figure 27: A graph showing the C-specific bacteriocidal activity elicited 
by monovalent (A) and trivalent (A/B/C) meningococcal conjugate vaccines in 
mice. 
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Detailed Description of the Invention 



It is possible to overcome some of the difficulties involved in expressing 
and purifying the outer membrane group B porin proteins of N. meningitidis from 
E. coli. The DNA sequences of the mature porin proteins, e.g. class 2 and class 
3 as well as fusions thereof, were amplified using the chromosome of the 
meningococcal bacteria as a template for the PCR reaction. The amplified porin 
sequences were ligated and cloned into an expression vector containing the T7 
promoter. E. coli strain BL21 lysogenic for the DE3 lambda phage (Studier and 
Moffatt, 1 Mol Biol 189: 113-130 (1 986)), modified to eliminate the ompA gene, 
was selected as one expression host for the pET-1 7b plasmid containing the porin 
gene. Upon induction, large amounts of the meningococcal porin proteins 
accumulated within E. coli without any obvious lethal effects to the host 
bacterium. The expressed meningococcal porin proteins were extracted and 
processed through standard procedures and finally purified by molecular sieve 
chromatography and ion exchange chromatography. As judged by the protein 
profile from the molecular sieve chromatography, the recombinant 
meningococcal porins eluted from the column as trimers. To be certain that no 
PCR artifacts had been introduced into the meningococcal porin genes to allow 
for such high expression, the inserted PorB gene sequence was determined. 
Inhibition ELISA assays were used to give further evidence that the expressed 
recombinant porin proteins had renatured into their natural antigenic and trimer 
conformation. 

As an alternative to the bacterial E. coli host system. Meningococcal B 
Class 3 porin protein (MB3) may be expressed in yeast. A preferred host is the 
methylotrophic yeast Pichia pastoris, which may be transformed with the Pichia 
Expression Kit developed by Invitrogen. Yeasts are attractive hosts for the 
production of heterologous proteins. Unlike prokaryotic systems, their eukaryotic 
subcellular organization enables them to carry out many of the post-translational 
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folding, processing and modification events required to produce "authentic' 1 and 
bioactive proteins. As a eukaryote, Pichia pastoris has many of the advantages 
of a higher eukaryotic expression system, while being as easy to manipulate as 
E. coli or Saccharomyces cerevisiae. As a yeast, it shares the advantages of 
5 molecular and genetic manipulations with Saccharomyces, and it has the added 

advantages of 10- to 100-fold higher heterologous protein expression levels and 
the protein processing characteristics of higher eukaryotes. 

Expression in Pichia also provides advantages compared to expression in 
other yeast strains because Pichia does not tend to hyperglycosylate proteins as 

10 does S. cerevisiae. Further, proteins expressed and modified in Pichia may be 

more useful therapeutically than those produced by S. cerevisiae, as 
oligosaccharides added by Pichia lack the a\3 glycan linkages which are 
believed to be primarily responsible for the hyper-antigenic nature of proteins 
produced by S. cerevisiae. Several vaccine antigens have been produced in yeast 

15 cells, including hepatitis B surface antigen which is in clinical use (Cregg et al, 

Bio/Technology 77:905-910 (1993)). 

Unlike the porin proteins of E. coli and a few other gram negative 
bacteria, relatively little is known about how changes in the primary sequence of 
porins from Neisseria effect their ion selectivity, voltage dependence, and other 

20 biophysical functions. Recently, the crystalline structure of two E. coli porins, 

OmpF and PhoE, were solved to 2.4A and 3.0A, respectively (Cowan, S.W., 
et al, Nature 358:727-733 (1992)). Both of these E. coli porins have been 
intensively studied owing to their unusual stability and ease with which molecular 
genetic manipulations could be accomplished. The data obtained for the genetics 

25 of these two porins correlated well with the crystalline structure. Although it has 

been shown in several studies using monoclonal antibodies to select neisserial 
porins that the surface topology of Neisseria closely resembles that of these two 
E. coli porins (van der Ley, P., et al, Infect. Immun. 59:2963-2971 (1991)), 
almost no information is available about how changes in amino acid sequences 

30 in specific areas of the neisserial porins effect their biophysical characteristics, 



WO 97/28273 



PCT/US97/01687 



-21 - 

as is known about the E. coli porins (Cowan, S.W., et al, Nature 358:121-133 
(1992)). 

Two of the major problems impeding this research are: (1) the inability 
to easily manipulate Neisseria genetically by modern molecular techniques and 
(2) the inability to express sufficient quantities of neisserial porins in E. coli or 
yeast for further purification to obtain biophysical and biochemical 
characterization data. In fact, most of the DNA sequence data on gonococcal and 
meningococcal porins have been obtained by cloning overlapping pieces of the 
porin gene and then reconstructing the information to reveal the entire gene 
sequence (Gotschlich, E.C, et al, Proc. Natl. Acad. Sci. USA #4:8135-8139 
(1987); Murakami, K., et al, Infect. Immun. 57:2318-2323 (1989)). Carbonetti 
et al were the first to clone an entire gonococcal porin gene into E. coli using a 
tightly controlled pT7-5 expression plasmid. The results of these studies showed 
that when the porin gene was induced, very little porin protein accumulated and 
the expression of this protein was lethal to the E. coli (Carbonetti and Sparling, 
Proc. Natl Acad. Sci. USA #4:9084-9088 (1987)). In additional studies, 
Carbonetti et al. {Proc. Natl Acad. Sci. USA #5:684 1 -6845 ( 1 988)) did show that 
alterations in the gonococcal porin gene could be made in this system in E. coli 
and then reintroduced into gonococci. However, the ease with which one can 
make these manipulations and obtain enough porin protein for further 
biochemical and biophysical characterization seems limited. 

Feavers et al. have described a method to amplify, by PCR, neisserial 
porin genes from a wide variety of sources using two synthesized 
oligonucleotides to common domains at the 5' and 3' ends of the porin genes 
respectively (Feavers, I.M., et al, Infect. Immun. 60:3620-3629 (1992)). The 
oligonucleotides were constructed such that the amplified DNA could be forced 
cloned into plasmids using the restriction endonucleases BglW and Xho\. 

Using the Feavers et al PCR system, the DNA sequence of the mature 
PorB protein from meningococcal strain 8765 serotype 15 was amplified and 
ligated into the BamUl-Xhol site of the T7 expression plasmid pET-17b. This 
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placed the mature PorB protein sequence in frame directly behind the T7 
promoter and 20 amino acids of the 4>10 protein including the leader sequence. 
Upon addition of IPTG to a culture of E. coli containing this plasmid, large 
amounts of PorB protein accumulated within the bacteria. A complete 
explanation for why this construction was non-lethal to the E. coli and expressed 
large amount of the porin protein, await further studies. However, one possible 
hypothesis is that by replacing the neisserial promoter and signal sequence with 
that of the T7 and (J) 10 respectively, the porin product was directed to the 
cytoplasm rather than toward the outer membrane. Henning and co-workers have 
reported that when E. coli OmpA protein and its fragments are expressed, those 
products which are found in the cytoplasm are less toxic than those directed 
toward the periplasmic space (Klose, M., et al,J. Biol Chem. 265:13291-13296 
(1988); Klose, M., et al, J. Biol Chem. 265:13297-13302 (1988); FreudK R., 
et al, J. Mol Biol 205:111-115 (1989)). Whatever the explanation, once the 
PorB protein was expressed, it was easily isolated, purified and appeared to 
reform into trimers much like the native porin. The results of the inhibition 
ELISA data using human immune sera suggests that the PorB protein obtained 
in this fashion regains most if not all of the antigenic characteristics of the wild 
type PorB protein purified from meningococci. This expression system lends 
itself to the easy manipulation of the neisserial porin gene by modern molecular 
techniques. In addition, this system allows one to obtain large quantities of pure 
porin protein for characterization. In addition, the present expression system 
allows the genes from numerous strains of Neisseria, both gonococci and 
meningococci, to be examined and characterized in a similar manner. 

The Neisseria meningitidis outer membrane class 3 protein from 
serogroup B (MB3) was also expressed in the methylotrophic yeast Pichia 
pastoris by placing the MB3 DNA fragment under the control of the strong P. 
pastoris alcohol oxidase promoter AOXL Upon induction on methanol, strains 
of P. pastoris transformed with the recombinant plasmids produced either a 
native or a fusion MB3 protein, which were reactive with mouse polyclonal 
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antibodies raised against the wild type counterpart. In shaking flask cultures, 
engineered P. pastoris yielded about 1-3 mg of expressed protein per gram of 
pelleted wet cells, or 100-600 mg per liter, which corresponded to 10-15% of the 
yeast cell suspension or about 3-5% of total cellular proteins (Table 4). Full- 
length MB 3 DNA was cloned into each of four Pichia Expression Vectors 
developed by Invitrogen. To obtain the expression of monomeric, full size 34 
kDa MB 3 protein, the intracellular pHIL-D2 vector was used. A map of the 
pHIL-D2 vector may be found on p. 19 of the Invitrogen Instruction Manual for 
the Pichia Expression Kit, Version E, the contents of which is hereby 
incorporated by reference. This construct provided maximal expression levels 
(up to 3 mg of MB3 per gram of cells) (Tables 3 and 4 ). The expressed product 
was not secreted, being mainly (95%) insoluble, and it was tightly associated with 
cell membranes. 

To further increase the possibility for the secretion of expressed MB3, 
three other vectors with different secretion signals were also used: the vector 
pHIL-Sl, which carries a native Pichia pastoris signal sequence from the acid 
phosphatase gene, PHOl, and the vectors pPIC9 and pPIC9K, which carry the 
secretion signal from the S. cerevisiae a-mating factor prepro-peptide. Maps of 
the pHIL-Sl and pPIC9 vectors may be found on pp. 21-22 of the Invitrogen 
Instruction Manual for the Pichia Expression Kit, Version E. It was found that 
the pHIL-Sl/MB3 construct provided the expression of a MB3- PHOl fusion 
polypeptide with an apparent molecular weight of 36.5 kDa. which was partly 
processed to generate mature 34 kDa MB3. About 5- 1 0% of expressed MB3 was 
secreted to the yeast growth medium, and about 40-50% of the 36.5 kDa fusion 
polypeptide was cleaved (Table 4). Pichia recombinants transformed by 
pPIC9/MB3 or pPIC9K/MB3 constructs expressed only MB 3 fused with a-factor, 
yielding a fusion polypeptide of approximately 45 kDa. There was no evidence 
of any cleavage or processing of that fusion protein. 

Preliminary studies on the isolation and purification of recombinant MB3 
(pHIL-D2/MB3 containing transformants) suggest that when expressed in P 
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pastoris, MB3 may form trimers under native conditions, and that the native 
protein is resistant to trypsin digestion. These results are similar to those which 
have been observed for the wild-type counterpart. 

An increase in the yield of expressed MB3 may be obtained by using 
5 strains of Pichia which contain multiple copies of the MB3 expression cassette. 

Strains harboring multiple copies exist naturally within transformed cell 
populations at <1 0% frequency. These strains may be identified either by directly 
screening large numbers of transformants for levels of MB 3 expression via SDS- 
PAGE or immunoblotting, or indirectly screening by "dot blot" hybridization to 

10 select for clones containing multiple copies of the MB3 gene (Cregg et al„ 

Bio/Technology 77:905-910 (1993)). Alternatively, such multiple integrated 
clones may be constructed by using a new pA0815 vector (Invitrogen), which 
allows cloning of multiple copies of the gene of interest via repeated cassette 
insertion steps {Ibid at p. 907). Scale-up procedures using a fermenter will 

15 provide higher yeast cell densities and therefore improve the yields of the 

expressed proteins by at least 5-10 times. Optimization of protein expression 
(i.e., growth media composition, buffering capacity, casamino acids 
supplementation, increase of methanol concentration, etc.) may be carried out 
with routine experimentation. 

20 Another way to identify Pichia transformants having multiple copies of 

MB3 takes advantage of the fact that the Pichia expression vector pPIC9K carries 
the kanamycin resistance gene which confers resistance to G418; in other 
respects, pPIC9K corresponds to pPIC9. Spontaneous generation of multiple 
insertion events can then be identified by the level of resistance to G41 8. Pichia 

25 transformants are selected on histidine-deficient medium and screened for their 

level of resistance to G418. An increased level of resistance to G418 indicates 
multiple copies of the kanamycin resistance gene. 

Thus, the present invention relates to a method of expressing an outer 
membrane meningococcal group B porin protein, in particular, the class 2 and 

30 class 3 porin proteins. 
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In one embodiment, the present invention relates to a method of 
expressing the outer membrane meningococcal group B porin protein in E. coli 
comprising: 

(a) transforming E. coli by a vector comprising a selectable marker 
and a gene coding for a protein selected from the group consisting 
of: 

(i) a mature porin protein, and 

(ii) a fusion protein comprising a mature porin protein fused to 
amino acids 1 to 20 or 22 of the T7 gene 4>10 capsid protein; 

wherein said gene is operably linked to the T7 promoter; 

(b) growing the transformed E. coli in a culture media containing a 
selection agent, and 

(c) inducing expression of said protein; 

wherein the protein so produced comprises more than about 2% of the total 
protein expressed in the E. coli. 

In a preferred embodiment, the meningococcal group B porin protein or 
fusion protein expressed comprises more than about 5% of the total proteins 
expressed in E. coli. In another preferred embodiment, the meningococcal group 
B porin protein or fusion protein expressed comprises more than about 10% of 
the total proteins expressed in E. coli. In yet another preferred embodiment, the 
meningococcal group B porin protein or fusion protein expressed comprises more 
than about 30% of the total proteins expressed in E. coli. 

Examples of plasmids which contain the T7 inducible promotor include 
the expression plasmids pET-17b, pET-Ila, pET-24a-d(+) and pET-9a, all of 
which are commercially available from Novagen (565 Science Drive, Madison, 
WI 5371 1). These plasmids comprise, in sequence, a T7 promoter, optionally a 
lac operator, a ribosome binding site, restriction sites to allow insertion of the 
structural gene and a T7 terminator sequence. See. the Novagen catalogue, pages 
36-43 (1993). 
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In a preferred embodiment, E. coli strain BL21 (DE3) AompA is 
employed. The above mentioned plasmids may be transformed into this strain or 
the wild-type strain BL21(DE3). E. coli strain BL21 (DE3) Lamp A is preferred 
as no OmpA protein is produced by this strain which might contaminate the 
purified porin protein and create undesirable immunogenic side effects. 

The transformed E. coli are grown in a medium containing a selection 
agent, e.g. any P-lactam to which £. coli is sensitive such as ampicillin. The pET 
expression vectors provide selectable markers which confer antibiotic resistance 
to the transformed organism. 

High level expression of meningococcal group B porin protein can be 
toxic in E. coli. Surprisingly, the present invention allows E. coli to express the 
protein to a level of at least almost 30% and as high as >50% of the total cellular 
proteins. 

In another embodiment, the present invention relates to a method of 
expressing an outer membrane meningococcal group B porin protein in yeast 
comprising: 

(a) ligating into a plasmid having a selectable marker a gene coding 
for a protein selected from the group consisting of : 

(i) a mature porin protein, and 

(ii) a fusion protein comprising a mature porin protein fused to 
a yeast secretion signal peptide; 

wherein said gene is operably linked to a yeast promoter; 

(b) transforming the plasmid containing the gene into a yeast strain; 

(c) selecting the transformed yeast by growing said yeast in a culture 
medium allowing selection of said transformed yeast; 

(d) growing the transformed yeast, and 

(e) inducing expression of said protein to give yeast containing said 
protein. 

Transformation of the yeast host may be accomplished by any one of 
several techniques that are well known by those of ordinary skill in the art. These 
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techniques include direct or liposome-mediated transformation of yeast cells 
whose cell wall has been partially or completely destroyed to form spheroplasts, 
treatment of the host with alkali cations and PEG, and freeze-thawing combined 
with PEG treatment. (See Weber et al, N unconventional Yeasts: Their Genetics 
and Biotechnological Applications, CRC Crit. Rev. Biotechnoi 7: 281, 317 
(1988) and the references cited therein, all of which are hereby fully incorporated 
by reference.) 

In another preferred embodiment, the mature porin protein or fusion 
protein expressed comprises more than about 2% of the total protein expressed 
in the yeast host. In yet another preferred embodiment, the mature porin protein 
or fusion protein expressed comprises about 3-5% of the total protein expressed 
in the yeast host. 

In another preferred embodiment, the mature porin protein is the Neisseria 
meningitidis mature outer membrane class 3 protein from serogroup B. 

In another preferred embodiment, the present invention relates to 
performing the above method of expressing the outer membrane meningococcal 
group B porin protein or fusion protein in yeast, wherein said yeast is selected 
from the group consisting of; Saccharomyces cerevisiae, Schizosaccharomyces 
pombe, Saccharomyces uvarum, Saccharomyces carlsbergensis, Saccharomyces 
diastaticus, Candida tropica/is, Candida maltosa, Candida parapsilosis, Pichia 
pastoris, Pichia farinosa t Pichia pinus, Pichia vanrijii, Pichia ferment ans, Pichia 
guilliermondil Pichia stipitis, Saccharomyces telluris, Candida utilis. Candida 
guilliermondii, Hansenuta henricii, Hansenula capsulata, Hansenula 
polymorpha, Hansenula saturnus, Lypomyces kononenkoae, Kluyveromyces 
marxianus. Candida lipolytica, Saccaromycopsis fihuligera, Saccharomycodes 
ludwigii, Saccharomyces kluyveri, Tremella me sent erica, Zygosaccharomyces 
acidofaciens, Zygosaccharomyces fermentati s Yarrowia lipolytics and 
Zygosaccharomyces sofa. Many of these yeast hosts are available from the 
American Type Culture Collection. Rockville, Md. 
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In another preferred embodiment, the nucleotide sequence of the gene 
encoding the mature porin protein or fusion protein incorporates codons which 
are optimized for yeast codon usage. In yet another preferred embodiment, the 
nucleotide sequence of the gene encoding the mature porin protein which has 
5 been optimized for yeast codon usage is the nucleotide sequence SEQ ID NO: 26. 

In another preferred embodiment, the yeast secretion signal peptide is 
selected from the group consisting of the secretion signal of the S. cerevisiae a- 
mating factor prepro-peptide and the secretion signal of the P. pastoris acid 
phosphatase gene. 

10 In another preferred embodiment, the yeast secretes the protein or fusion 

protein. 

In another preferred embodiment, the yeast promoter to which the gene 
is operably linked is selected from a group consisting of the AOX1 promoter, 
the GAPDH promoter, the PH05 promoter, the glyceraldehyde-3 -phosphate 

15 dehydrogenase (TDH3) promoter, the ADHI promoter, the MFal promoter, and 

the GAL 10 promoter. Examples of plasmids which contain the AOX1 promoter 
include the expression plasmids pHIL-D2, pHIL-S 1 , pPIC9, and pPIC9K. These 
plasmids comprise, in sequence, an AOX1 promoter, restriction sites to allow 
insertion of the structural gene, an AOX1 transcription termination fragment, an 

20 open reading frame encoding HIS4 {histidinol dehydrogenase), an ampicillin 

resistance gene, and a ColEl origin. In addition, plasmids pPIC9 and pPIC9K 
comprise the ot-factor secretion signal of S. cerevisiae % and plasmid pHIL-Sl 
comprises the PHOl secretion signal of P. pastoris. pPIC9K also includes the 
kanamycin resistance gene, which confers resistance to G418 in Pichia. The 

25 level of G418 resistance in Pichia transformants can be used to identify cells 

which have undergone multiple insertion events. This occurs at a frequency of 
1-10%. An increased level of resistance to G418 indicates the presence of 
multiple copies of the kanamycin resistance gene and of the gene of interest. See 
the Novagene catalogue, Version E, pp. 19-22 (1995). 
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In another preferred embodiment, yeast host strains having a mutation in 
a suitable marker gene which causes the strain to have specific nutritional 
requirements are employed. Expression plasmids carrying a functional copy of 
the mutated gene as well as a copy of the meningococcal group B porin protein 
or fusion protein are then transformed into the yeast host strain, and transformants 
are selected on the basis of their ability to grow on medium lacking the required 
nutrient. Examples of suitable marker genes, followed by their S. cerevisiae 
notation, include the genes encoding imidazole glycerol phosphate 
dehydrogenase (HIS3), beta-isopropylmalate dehydrogenase (LEU2\ tryptophan 
synthase (TRP5\ argininosuccinate lyase (ARG4). A'-(5'-phosphorilosyl) 
anthranilate isomerase (TRP]\ histidinol dehydrogenase (HIS4), orotidine-5- 
phosphate decarboxylase {URA3), dihydroorotate dehydrogenase (URA1), 
galactokinase (GAL J), and alpha-aminodipate reductase (LYS2). After 
transformed yeast host cells are selected on the basis of their ability to grow in 
medium lacking the appropriate nutrient, the cells are screened for integration of 
the meningococcal group B porin protein or fusion protein at the correct loci. 
This screening is performed by methods well known to those of ordinary skill in 
the art, for example, by selecting for transformants which grow slowly on 
medium which lacks the nutrient used to confirm transformation and includes 
methanol in order to induce expression of the outer membrane meningococcal 
group B porin protein or fusion protein from the AOX1 promoter. These 
transformants are then grown up in glycerol-containing medium, and expression 
of the meningococcal group B porin protein or fusion protein is then induced by 
the addition of methanol. 

In a more preferred embodiment, P. pastohs host strains GS 1 1 5 or KM7 1 
are employed. These strains have a mutation in the histidinol dehydrogenase 
gene (his4) which prevents them from synthesizing histidine. The expression 
plasmids pHIL-D2, pHIL-Sl, pPIC9, and pPIC9K carry the HIS4 gene which 
complements his4 in the host, allowing selection of transformants on histidine- 
deficient medium. After transformed P. pastoris host cells are selected in 
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histidine-deficient medium, the cells are screened for integration of the 
meningococcal group B porin protein or fusion protein at the correct loci by 
selecting for transformants which grow slowly on his \ methanol^ plates. These 
transformants, which become mutated at the AOX1 locus when the MB3 gene 
5 inserts into the host genome, are only capable of slow growth on methanol, as 

they are metabolizing methanol with the less efficient AOX2 gene product. The 
transformants are then grown up in glycerol-containing medium, and expression 
of the meningococcal group B porin protein or fusion protein is then induced by 
the addition of methanol. 

10 In a most preferred embodiment, the present invention relates to 

performing the above method of expressing the outer membrane meningococcal 
group B porin protein in yeast, wherein said yeast is Pichia pastoris. 

In another preferred embodiment, the present invention relates to a 
vaccine for inducing an immune response in an animal comprising the outer 

15 membrane meningococcal group B porin protein or fusion protein thereof, 

produced according to the above-described methods, together with a 
pharmaceutical^ acceptable diluent, carrier, or excipienL wherein the vaccine 
may be administered in an amount effective to elicit an immune response in an 
animal to Neisseria meningitidis. In a preferred embodiment, the animal is 

20 selected from the group consisting of humans, cattle, pigs, sheep, and chickens. 

In another preferred embodiment, the animal is a human. 

In another preferred embodiment, the present invention relates to the 
above-described vaccine, wherein said outer membrane meningococcal group B 
porin protein or fusion protein thereof is conjugated to a meningococcal group B 

25 capsular polysaccharide (CP). Such capsular polysaccharides may be prepared 

as described in Ashton, F.E. et al.. Microbial Pathog. 6:455-458 (1989); 
Jennings, H.J. et al, J. Immunol 73^:265 1 (1985); Jennings. H.J. et al, J. 
Immunol. 757:1708-1713 (1986); Jennings, H.J. et al., J. Immunol. /-/2:3585- 
3591 (1989); Jennings, H.J., "Capsular Polysaccharides as Vaccine Candidates," 
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in Current Topics in Microbiology and Immunology, 750:105-107 (1990); the 
contents of each of which are fully incorporated by reference herein. 

The invention also relates to a vaccine capable of simultaneously inducing 
an immune response against any one of several N. meningitidis serogroups. Thus, 
in another preferred embodiment, the invention relates to a trivalent vaccine 
comprising the capsular polysaccharides from each of three different serogroups 
of N. meningitidis. Specifically, the vaccine of the invention comprises group A 
meningococcal polysaccharide (GAMP), group B meningococcal polysaccharide 
(GBMP), and group C meningococcal polysaccharide (GCMP) antigens, together 
with a pharmaceutical ly acceptable carrier. 

In a preferred embodiment, group A meningococcal polysaccharide 
(GAMP), group B meningococcal polysaccharide (GBMP), and group C 
meningococcal polysaccharide (GCMP) antigens are each conjugated to a protein 
carrier, thus yielding GAMP, GCMP and GBMP polysaccharide antigen 
conjugates. 

Of course, it will be understood by those of ordinary skill that a number 
of carrier proteins will be suitable to be used in the polysaccharide-protein 
conjugates included in the vaccine of the invention. A suitable carrier protein 
will be one which is safe for administration to mammals, and which is 
immunologically effective as a carrier. Safety includes absence of primary 
toxicity and minimal risk of allergic complications. 

In general, any heterologous protein could serve as a carrier antigen. The 
protein may be, for example, native toxin or detoxified toxin (also termed toxoid). 
In addition, genetically altered proteins which are antigenically similar to toxins 
and yet non-toxic may be produced by mutational techniques well-known to those 
of skill in the art. Such an altered toxin is termed a "cross reacting material," or 
CRM. CRM J97 is noteworthy, because it differs from native diphtheria toxin at 
only one amino acid residue, and is immunologically indistinguishable from the 
native toxin (Anderson, P.W., Infect. Immun. 59:233-238 (1983)). 
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It will be understood by those of skill in the art that the polysaccharide- 
protein carrier conjugates of the vaccine may be produced by several different 
methods. The types of covalent bonds which couple a polysaccharide to a protein 
carrier, and the means of producing them, are well known to those of skill in the 
5 art. Details concerning the chemical means by which the two moieties can be 

linked may be found in U.S. Patent No. 5,371,197, and 4,902,506, the contents 
of which are herein incorporated by reference in their entirety. One such method 
is the reductive amination process described in Schwartz and Gray (Arch. 
Biochim. Biophys. 7^7:542-549 (1977)). This process involves reacting the 
10 reducing capsular polysaccharide fragment and bacterial toxin or toxoid in the 

presence of cyanoborohydride ions, or another reducing agent. Such a process 
will not adversely affect the toxin or toxoid or the capsular polysaccharide (U.S. 
Patent No. 4,902,506). Such a conjugation process is also described in Examples 
12-14, below. 

15 While tetanus and diphtheria toxins are the prime candidates for carrier 

proteins, owing to their history of safety, there may be overwhelming reasons, 
well known to those of ordinary skill in the art. to use another protein. For 
example, another protein may be more effective as a carrier, or production 
economics may be significant. Other candidates include toxins or toxoids of 

20 pseudomonas, staphylococcus, streptococcus, pertussis and entertoxigenic 

bacteria, including Escherichia coli. A preferred carrier protein to which the 
group B meningococcal polysaccharide may be conjugated is the class 3 porin 
protein (PorB) of group B N. meningitidis. A preferred protein carrier protein 
to which GAMP antigen and GCMP antigen may be conjugated is tetanus 

25 toxoid. 

It is known in the art that the immunogenicity of GBMP is limited in 
humans, and especially in infant humans, and that direct covalent couplings of the 
group B polysaccharide to tetanus toxoid yielded a conjugate which failed to 
induce a significant polysaccharide-specific response in either rabbits (Jennings, 
30 H.J. and Lugowski, C, J. Immunol. 727:101 1-1018 (1981)) or mice (Jennings. 
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HJ. et al, J. Immunol 7 J7: 1 708-1 71 3 (1986)). This failure prompted interest in 
the direct chemical modification of the group B polysaccharide. This was done 
with the idea of creating synthetic epitopes capable of modulating the immune 
response in such a way as to produce enhanced levels of cross-reactive B 
polysaccharide-specific antibodies (Jennings, H.J. et al, J. Immunol. 757:1708- 
1713 (1986)). 

It will be understood by those of ordinary skill in the art that in selecting 
possible chemical modifications of the group B polysaccharide (Jennings, H.J. et 
al., J. Immunol. 757:1708-1713 (1986)), two factors should be considered. First, 
the chemical modification should be able to be accomplished with facility and 
with the minimum of degradation of the polysaccharide. Second, in order to 
produce cross-reactive B polysaccharide-specific antibodies, the antigenicity of 
the modified polysaccharide to B polysaccharide-specific antibodies should be 
preserved. It will be understood by those of skill in the art that the ideal chemical 
modification of group B polysaccharide will retain both the carboxylate and the 
N-carbonyl groups (Jennings, HJ. et al, J. Immunol 757:1708-1713 (1986)). 
The most preferred modification which satisfies the above criteria is a 
modification wherein the N-acetyl groups of the sialic acid residues of the B 
polysaccharide are removed by strong base and replaced by N-propionyl groups 
(see Examples 6 and 1 4). 

In a more preferred embodiment, the N-propionylated GBMP is 
subsequently conjugated to a carrier protein. While any carrier protein which 
enhances the immunogenicity of N-propionylated GBMP may be used, a 
preferred protein carrier is the class 3 outer membrane protein of group B N. 
meningitidis (MB3, or PorB). 

Thus, in still another preferred embodiment, GBMP antigen is conjugated 
to PorB after having been N-propionylated. 

Preferably, the capsular polysaccharide (CP), which may be group A, B 
or C meningococcal polysaccharide, is isolated according to Frasch, C.E., 
"Production and Control of Neisseria meningitidis Vaccines" in Bacterial 



WO 97/28273 



PCT/US97/01687 



-34- 

Vaccines, Alan R. Liss, Inc., pages 123-145 (1990), the contents of which are 
fully incorporated by reference herein, as follows: 

Grow organisms in modified Franz medium 10 to 20 hrs 

1 Heat kill, 55 °C, lOmin 
Remove inactivated cells by centrifugation 

1 AddCetavlontoO.1% 
Precipitate CP from culture broth 

1 Add calcium chloride to 1 M 
Dissolve CP then centrifuge to remove cellular debris 

i Add ethyl alcohol to 25% 
Remove precipitated nucleic acids by centrifugation 

1 Add ethyl alcohol to 80% 
Precipitate crude CP and remove alcohol 

The crude CP is then further purified by gel filtration chromatography 
after partial depolymerization with dilute acid, e.g. acetic acid, formic acid, and 
trifluoroacetic acid (0.01-0.5 N), to give a mixture of polysaccharides having an 
average molecular weight of 10,000-20,000. Where the CP is GBMP, purified 
GBMP is then N-deacetylated with NaOH in the presence of sodium borohydride 
and N-propionylated to afford N-Pr GBMP. Thus, the CP that may be employed 
in the conjugate vaccines of the present invention may be CP fragments, N- 
deacylated CP and fragments thereof, as well as N-Pr CP and fragments thereof, 
so long as they induce active immunity when employed as part of a CP-porin 
protein conjugate (see Examples 6 and 14). 

In a further preferred embodiment, the present invention relates to a 
method of preparing a polysaccharide conjugate comprising: obtaining the 
above-described outer membrane meningococcal group B porin protein or fusion 
protein thereof; obtaining a CP from a Neisseria meningitidis organism; and 
conjugating the protein to the CP. 
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The conjugates of the invention may be formed by reacting the reducing 
end groups of the CP to primary amino groups of the porin by reductive 
amination. The reducing groups may be formed by selective hydrolysis or 
specific oxidative cleavage, or a combination of both. Preferably, the CP is 
conjugated to the porin protein by the method of Jennings et al, U.S. Patent No. 
4,356,170, the contents of which are fully incorporated by reference herein, which 
involves controlled oxidation of the CP with periodate followed by reductive 
amination with the porin protein. 

The vaccine of the present invention comprises the meningococcal group 
B porin protein, fusion protein or conjugate vaccine, or the trivalent GAMP. 
GBMP and GCMP vaccine, in an amount effective depending on the route of 
administration. Although subcutaneous or intramuscular routes of administration 
are preferred, the meningococcal group B porin protein, fusion protein or vaccine 
of the present invention can also be administered by an intraperitoneal or 
intravenous route. One skilled in the art will appreciate that the amounts to be 
administered for any particular treatment protocol can be readily determined 
without undue experimentation. Suitable amounts might be expected to fall 
within the range of 2 micrograms of the protein per kg body weight to 100 
micrograms per kg body weight. 

Thus, in a preferred embodiment, the vaccine comprises about 2 jag of the 
GAMP, GCMP and GBMP polysaccharide antigen conjugates. 

In another preferred embodiment, the vaccine comprises about 5 \xg of the 
GAMP, GCMP and GBMP polysaccharide antigen conjugates. 

In yet another referred embodiment, the vaccine comprises about 2 jig of 
the GAMP and GCMP polysaccharide antigen conjugates, and about 5 \xg of the 
GBMP polysaccharide antigen conjugate. 

The vaccine of the present invention may be employed in such forms as 
capsules, liquid solutions, suspensions or elixirs for oral administration, or sterile 
liquid forms such as solutions or suspensions. Any inert carrier is preferably 
used, such as saline, phosphate-buffered saline, or any such carrier in which the 
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meningococcal group B porin protein, fusion protein or conjugate vaccine have 
suitable solubility properties. The vaccines may be in the form of single dose 
preparations or in multi-dose flasks which can be used for mass vaccination 
programs. Reference is made to Remington's Pharmaceutical Sciences, Mack 
5 Publishing Co., Easton, PA, Osol (cd.) (1980); and New Trends and 

Developments in Vaccines^ Voller et al (eds.). University Park Press, Baltimore, 
MD (1978), for methods of preparing and using vaccines. 

The vaccines of the present invention may further comprise adjuvants 
which enhance production of porin-speciflc antibodies. Such adjuvants include, 

10 but are not limited to, various oil formulations such as Freund's complete 

adjuvant (CFA), stearyl tyrosine (ST, see U.S. Patent No. 4,258,029), the 
dipeptide known as MDP, saponin, aluminum hydroxide, and lymphatic cytokine. 

Freund's adjuvant is an emulsion of mineral oil and water which is mixed 
with the immunogenic substance. Although Freund's adjuvant is powerful, it is 

15 usually not administered to humans. Instead, the adjuvant alum (aluminum 

hydroxide) or ST may be used for administration to a human. The meningococcal 
group B porin protein or a conjugate vaccine thereof may be absorbed onto the 
aluminum hydroxide from which it is slowly released after injection. The 
meningococcal group B porin protein or group A, B and C meningococcal 

20 polysaccharide conjugate vaccine may also be encapsulated within liposomes 

according to Fullerton, U.S. Patent No. 4,235,877. 

In another preferred embodiment, the present invention relates to a 
method of inducing an immune response in an animal comprising administering 
to the animal the vaccine of the invention, produced according to methods 

25 described, in an amount effective to induce an immune response. 

In a further embodiment, the invention relates to a method of purifying 
the above-described outer membrane meningococcal group B porin protein or 
fusion protein comprising: lysing the transformed E. coli to release the 
meningococcal group B porin protein or fusion protein as part of insoluble 

30 inclusion bodies; washing the inclusion bodies with a buffer to remove 
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contaminating E. coli cellular proteins; resuspending and dissolving the inclusion 
bodies in an aqueous solution of a denaturant; diluting the resultant solution in 
a detergent; and purifying the solubilized meningococcal group B porin protein 
by gel filtration. 

The lysing step may be carried out according to any method known to 
those of ordinary skill in the art, e.g. by sonication, enzyme digestion, osmotic 
shock, or by passing through a mull press. 

The inclusion bodies may be washed with any buffer which is capable of 
solubilizing the E. coli cellular proteins without solubilizing the inclusion bodies 
comprising the meningococcal group B porin protein. Such buffers include but 
are not limited to TEN buffer (50 mM Tris HC1, 1 mM EDTA. 1 00 mM NaCl, pH 
8.0), Tricine, Bicine and HEPES. 

Denaturants which may be used in the practice of the invention include 
2 to 8 M urea or about 2 to 6 M guanidine HC1, more preferably, 4 to 8 M urea 
or about 4 to 6 M guanidine HC1, and most preferably, about 8 M urea or about 
6 M guanidine HC1. 

Examples of detergents which can be used to dilute the solubilized 
meningococcal group B porin protein include, but are not limited to, ionic 
detergents such as SDS and cetavlon (Calbiochem); non-ionic detergents such as 
Tween, Triton X, Brij 35 and octyl glucoside; and zwitterionic detergents such 
as 3,14-Zwittergent, empigen BB and Champs. 

Finally, the solubilized outer membrane meningococcal group B porin 
protein may be purified by gel filtration to separate the high and low molecular 
weight materials. Types of filtration gels include but are not limited to 
Sephacryl-300, Sepharose CL-6B, and Bio-Gel A-l .5m. The column is eluted 
with the buffer used to dilute the solubilized protein. The fractions containing the 
porin or fusion thereof may then be identified by gel electrophoresis, the fractions 
pooled, dialyzed, and concentrated. 
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Finally, substantially pure (>95%) porin protein and fusion protein may 
be obtained by passing the concentrated fractions through a Q sepharose high 
performance column. 

In another embodiment, the present invention relates to expression of the 
meningococcal group B porin protein gene which is part of a vector which 
comprises the T7 promoter, which is inducible. If a promoter is an inducible 
promoter, then the rate of transcription increases in response to an inducing agent. 
The T7 promoter is inducible by the addition of isopropyl p-D- 
thiogalactopyranoside (IPTG) to the culture medium. Alternatively, the Tac 
promotor or heat shock promotor may be employed. Preferably, the 
meningococcal group B porin protein gene is expressed from the pET-17 
expression vector or the pET-1 1 a expression vector, both of which contain the T7 
promoter. 

The cloning of the meningococcal group B porin protein gene or fusion 
gene into an expression vector may be carried out in accordance with 
conventional techniques, including blunt-ended or stagger-ended termini for 
ligation, restriction enzyme digestion to provide appropriate termini, filling in of 
cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable 
joining, and ligation with appropriate ligases. Reference is made to Sambrook 
et ai, Molecular Cloning: A Laboratory Manual, 2nd ed.„ Cold Spring Harbor, 
New York, Cold Spring Harbor Laboratory Press (1989), for general methods of 
cloning. 

The meningococcal group B porin protein and fusion protein expressed 
according to the present invention must be properly refolded in order to achieve 
a structure which is immunologically characteristic of the native protein. In yet 
another embodiment, the present invention relates to a method of refolding the 
above-described outer membrane protein and fusion protein comprising: lysing 
the transformed cells to release the meningococcal group B porin protein or 
fusion protein as part of insoluble inclusion bodies; washing the inclusion bodies 
with a buffer to remove contaminating cellular proteins; resuspending and 
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dissolving the inclusion bodies in an aqueous solution of a denaturant; diluting 
the resultant solution in a detergent; and purifying the solubilized meningococcal 
group B porin protein or fusion protein by gel filtration to give the refolded 
protein in the eluant. Surprisingly, it has been discovered that the folded trimeric 
meningococcal group B class 2 and class 3 porin proteins and fusion proteins are 
obtained directly in the eluant from the gel filtration column. 

In another preferred embodiment, the present invention relates to a 
substantially pure refolded outer membrane meningococcal group B porin protein 
and fusion protein produced according to the above-described methods. A 
substantially pure protein is a protein that is generally lacking in other cellular 
Neisseria meningitidis components as evidenced by, for example, electrophoresis. 
Such substantially pure proteins have a purity of >95% r as measured by 
densitometry on an electrophoretic gel after staining with Coomassie blue or 
silver stains. 

The following examples are illustrative, but not limiting, of the method 
and compositions of the present invention. Other suitable modifications and 
adaptations of the variety of conditions and parameters normally encountered in 
this art which are obvious to those skilled in the art are within the spirit and scope 
of the present invention. 

Examples 

Example 1. Cloning of the Class 3 Porin Protein from Group B 
Neisseria meningitidis 

Materials and Methods 

Organisms: The Group B Neisseria meningitidis strain 8765 (B: 15:P1 ,3) 
was obtained from Dr. Wendell Zollinger (Walter Reed Army Institute for 
Research) and grown on agar media previously described (Swanson, J.L., Infect. 
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Immun. 27:292-302 (1978)) in a candle extinction jar in an incubator maintained 
at 30°C. Escherichia coli strains DME558 (from the collection of S. Benson; 
Silhavy, T.J. et aL, "Experiments with Gene Fusions," Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y., 1984), BRE51 (Bremer, E. et al, FEMS 
5 Microbiol Lett. 35:173-178 (1986)) and BL21(DE3) were grown on LB agar 

plates at 37°C. 

PI Transduction: A Pl v , r lysate of E. coli strain DME558 was used to 
transduce a tetracycline resistance marker to strain BRE51 (Bremer, E., et al, 
FEMS Microbiol Lett. 35:173-178 (1986)) in which the entire ompA gene had 

10 been deleted (Silhavy, T.J., et al, Experiments with Gene Fusions, Cold Spring 

Harbor Laboratory, Cold Spring Harbor, NY (1984)). Strain DME558, 
containing the tetracycline resistance marker in close proximity of the ompA 
gene, was grown in LB medium until it reached a density of approximately 0.6 
OD at 600 nm. One tenth of a milliliter of 0.5 M CaCL was added to the 10 ml 

15 culture and 0.1 ml of a solution containing 1 x 10 9 PFU of Pl v/r The culture was 

incubated for 3 hours at 37 °C. After this time, the bacterial cell density was 
visibly reduced. 0.5 ml of chloroform was added and the phage culture stored at 
4°C. Because typically 1-2% of the E. coli chromosome can be packaged in each 
phage, the number of phage generated covers the entire bacterial host 

20 chromosome, including the tetracycline resistance marker close to the ompA 

gene. 

Next, strain BRE51, which lacks the ompA gene, was grown in LB 
medium overnight at 37 °C. The overnight culture was diluted 1 :50 into fresh LB 
and grown for 2 hr. The cells were removed by centrifugation and resuspended 

25 in MC salts. 0.1 ml of the bacterial cells were mixed with 0.05 of the phage 

lysate described above and incubated for 20 min. at room temperature. 
Thereafter, an equal volume of 1 M sodium citrate was added and the bacterial 
cells were plated out onto LB plates containing 12.5 jig/ml of tetracycline. The 
plates were incubated overnight at 37°C. Tetracycline resistant (12 |ig/ml) 

30 transductants were screened for lack of OmpA protein expression by SDS-PAGE 
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and Western Blot analysis, as described below. The bacteria resistant to the 
antibiotic have the tetracycline resistance gene integrated into the chromosome 
very near where the ompA gene had been deleted from this strain. One particular 
strain was designated BRE-T R . 

A second round of phage production was then carried out with the strain 
BRE-T R , using the same method as described above. Representatives of this 
phage population contain both the tetracycline resistance gene and the OmpA 
deletion. These phage were then collected and stored. These phage were then 
used to infect E. coli BL21(DE3). After infection, the bacteria contain the 
tetracycline resistance marker. In addition, there is a high probability that the 
OmpA deletion was selected on the LB plates containing tetracycline. 

Colonies of bacteria which grew on the plates were grown up separately 
in LB medium and tested for the presence of the OmpA protein. Of those 
colonies selected for examination, all lacked the OmpA protein as judged by 
antibody reactivity on SDS-PAGE western blots. 

SDS-PAGE and Western Blot: The SDS-PAGE was a variation of 
Laemmli's method (Laemmli, U.K., Nature 227:680-685 (1970)) as described 
previously (Blake and Gotschlich, J. Exp. Med. 759:452-462 (1984)). 
Electrophoretic transfer to Immobilon P (Millipore Corp. Bedford, MA) was 
performed according to the methods of Towbin ef al. (Towbin, II., et al* Proc. 
Natl Acad. Sci USA 76:4350-4354 (1979)) with the exception that the paper was 
first wetted in methanol. The Western blots were probed with phosphatase 
conjugated reagents (Blake, M.S., et ai.Analyt. Biochem. 136: 1 75-1 79 (1984)). 

Polymerase Chain Reaction: The method described by Feavers et al. 
(Feavers, I.M., eta!., Infect, lmmun. (50:3620-3629 (1992)) was used to amplify 
the gene encoding the PorB. The primers selected were primers 33 (GGG GTA 
GAT CTG CAG GTT ACC TTG TAC GGT ACA ATT AAA GCA GGC GT) 
and 34 (GGG GGG GTG ACC CTC GAG TTA GAA TTT GTG ACG CAG 
ACC A AC) as previously described (Feavers, I.M., et al. Infect, lmmun. 
(50:3620-3629 (1992)). Briefly, the reaction components were as follows: 
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Meningococcal strain 8765 chromosomal DNA (100 ng/|il), 1 \x\\ 5' and 3' 
primers (1 |iM) 2 pi each; dNTP (10 mM stocks), 4 |il each; 10 X PCR reaction 
buffer (100 mM Tris HC1, 500 mM KC1, pH 8.3), 10 pi; 25 mM MgCl 2 , 6 pi; 
double distilled H 2 0, 62 pi; and Taq polymerase (Cetus Corp., 5 u/jil), 1 pi. The 
5 reaction was carried out in a GTC-2 Genetic Thermocycler (Precision Inst. Inc, 

Chicago, IL) connected to a Lauda 4/K methanol/water cooling system 
(Brinkman Instruments, Inc., Westbury, NY) set at 0°C. The thermocycler was 
programmed to cycle 30 times through: 94°C, 2 min.; 40°C, 2 min.; and 72 °C\ 
3 min. At the end of these 30 cycles, the reaction was extended at 72 °C for 3 min 
10 and finally held at 4°C until readied for analysis on a 1% agarose gel in TAE 

buffer as described by Maniatis (Maniatis, T., et ai. Molecular Cloning, A 
Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
(1982)). 

Subcloning of the PCR product: The pET-17b plasmid (Novagen, Inc.) 

15 was used for subcloning and was prepared by double digesting the plasmid with 

the restriction endonucleases BamHl and Xhol (New England Biolabs, Inc., 
Beverly, MA). The digested ends were then dephosphorylated with calf intestinal 
alkaline phosphatase (Boehringer Mannheim, Indianapolis, IN). The digested 
plasmid was then analyzed on a 1% agarose gel, the cut plasmid removed, and 

20 purified using the GeneClean kit (Biol OK La Jolla, CA). The PCR product was 

prepared by extraction with phenol-chloroform, chloroform, and finally purified 
using the GeneClean Kit (Bio 101). The PCR product was digested with 
restriction endonucleases Bglll and Xhol (New England Biolabs, Inc.). The 
DNA was then extracted with phenol-chloroform, precipitated by adding 0.1 

25 volumes of 3 M sodium acetate, 5 fil glycogen (20 pg/pl). and 2.5 volumes of 

ethanol. After washing the DNA with 70% ethanol (vol/vol), it was redissolved 
in TE buffer. The digested PCR product was ligated to the double digested pET- 
17b plasmid described above using the standard T4 ligase procedure at 16°C 
overnight (Current Protocols in Molecular Biology^ John Wiley & Sons, New 

30 York (1993)). The ligation product was then transformed into the BL21 (DE3)- 
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AompA described above which were made competent by the method of Chung 
etal (Chung, C.T., etai, Proa Natl. Acad Set USA £(5:2172-2175 (1989)). The 
transformants were selected on LB plates containing 50 ng/ml carbenicillin and 
12jag/ml tetracycline. Several transformants were selected, cultured in LB both 
containing carbenicillin and tetracycline for 6 hours at 30 °C. and plasmid gene 
expression inducted by the addition of IPTG. The temperature was raised to 
37°C and the cultures continued for an additional 2 hrs. The cells of each culture 
were collected by centrifugation, whole cell lysates prepared, and analyzed by 
SDS-PAGE and Western Blot using a monoclonal antibody (4D1 1) which reacts 
with all neisserial porins. 

Nucleotide Sequence Analysis: The nucleotide sequences of the cloned 
Class 3 porin gene DNA were determined by the dideoxy method using denatured 
double-stranded plasmid DNA as the template as described {Current Protocols 
in Molecular Biology, John Wiley & Sons, New York ( 1 993)). Sequenase II kits 
(United States Biochemical Corp., Cleveland, OH) were used in accordance with 
the manufacturer's instructions. The three synthesized oligonucleotide primers 
(Operon Technologies, Inc., Alameda, CA) were used for these reactions. One 
for the 5' end, which consisted of 5'TCAAGCTTGGTACCGAGCTC and two 
for the 3' end, 5TTTGTTAGCAGCCGGATCTG and 5' 
CTCAAGACCCGTTTAGAGGCC. Overlapping, nested deletions were made 
by linearizing the plasmid DNA by restriction endonuclease Bpu\\02\ and the 
ends blunted by the addition of Thio-dNTP and Klenow polymerase {Current 
Protocols in Molecular Biology, John Wiley & Sons, New York (1993)). The 
linearized plasmid was then cleaved with restriction endonuclease Xliol and the 
exoII/Mung bean nuclease deletion kit used to make 3' deletions of the plasmid 
(Stratagene. Inc., La Jolla, CA) as instructed by the supplier. A map of this 
strategy is shown in Figure 1 . 

Expression and purification of the PorB gene product: Using a sterile 
micropipette tip, a single colony of the BL21 (DE3)-Aom/^A containing the PorB- 
pET-17b plasmid was selected and inoculated into 10 ml of LB broth containing 
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50 \xg/ml carbenicillin. The culture was incubated overnight at 30 °C while 
shaking. The 10 ml overnight culture was then sterilely added to 1 liter of LB 
broth with the same concentration of carbenicillin, and the culture continued in 
a shaking incubator at 37 °C until the OD 600 reached 0.6-1 .0. Three mis of a stock 
5 solution of IPTG (100 mM) was added to the culture and the culture incubated for 

an additional 30 min. Rifampicin was then added (5.88 ml of a stock solution; 
34 mg/ml in methanol) and the culture continued for an additional 2 hrs. The 
cells were harvested by centrifugation at 10,000 rpm in a GS3 rotor for 10 min 
and weighed. The cells were thoroughly resuspended in 3 ml of TEN buffer (50 

10 mM Tris HC1, 1 mM Tris HC1, 1 mM EDTA, 100 mM NaCl, pH 8.0) per gram 

wet weight of cells. To this was added 8 \xl of PMSF stock solution (50 mM in 
anhydrous ethanol) and 80 \xl of a lysozyme stock solution (10 mg/ml in water) 
per gram wet weight of cells. This mixture was stirred at room temperature for 
20 min. While stirring, 4 mg per gram wet weight of cells of deoxycholate was 

15 added. The mixture was placed in a 37°C water bath and stirred with a glass rod. 

When the mixture became viscous, 20 ^1 of DNase I stock solution (! mg/ml) 
was added per gram weight wet cells. The mixture was then removed from the 
water bath and left at room temperature until the solution was no longer viscous. 
The mixture was then centrifuged at 15,000 rpm in a SS-34 rotor for 20 min at 

20 4°C. The pellet was retained and thoroughly washed twice with TEN buffer. 

The pellet was then resuspended in freshly prepared TEN buffer containing 0.1 
mM PMSF and 8 M urea and sonicated in a bath sonicator (Heat Systems, Inc., 
Plain view. NY). The protein concentration was determined using a BCA kit 
(Pierce, Rockville, IL) and the protein concentration adjusted to less than 10 

25 mg/ml using the TEN-urea buffer. The sample was then diluted 1 : 1 with 10% 

(weight/vol) Zwittergent 3,14 (Calbiochem, La Jolku CA), sonicated, and loaded 
onto a Sephacryl S-300 molecular sieve column. The Sephacryl S-300 column 
(2.5 cm x 200 cm) had previously equilibrated with 100 mM Tris HCK 200 mM 
NaCl, 10 mM EDTA, 0.05% Zwittergent 3 J 4, and 0.02% azide. pH 8.0. The 

30 column flow rate was adjusted to 8 ml/hr and 1 0 ml fractions were collected. The 
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OD 280 of each fraction was measured and SDS-PAGE analysis performed on 
protein containing fractions. 

Inhibition ELISA Assays: Microtiter plates (Nunc-Immuno Plate IIF, 
Nunc, Inc., Naperville, IL) were sensitized by adding 0.1 ml per well of porB (2 
Hg/ml) purified from the wild type strain 8765, in 0.1 M carbonate buffer, pH 9.6 
with 0.02% azide. The plates were incubated overnight at room temperature. 
The plates were washed five times with 0.9% NaCl, 0.05% Brij 35, 10 mM 
sodium acetate pH 7.0, 0.02% azide. Human immune sera raised against the 
Type 15 Class 3 PorB protein was obtained from Dr. Phillip O. Livingston, 
Memorial-Sloan Kettering Cancer Center, New York, N.Y. The human immune 
sera was diluted in PBS with 0.5% Brij 35 and added to the plate and incubated 
for 2 hr at room temperature. The plates were again washed as before and the 
secondary antibody, alkaline phosphatase conjugated goat anti-human IgG (Tago 
Inc., Burlingame, CA), was diluted in PBS-Brij, added to the plates and incubated 
for 1 hr at room temperature. The plates were washed as before and 
/?-nitrophenyl phosphate (Sigma Phosphatase Substrate 104) (1 mg/ml) in 0.1 
diethanolamine, 1 mM MgCl 2 , 0.1 mM ZnCl 2 , 0.02% azide, pH 9.8, was added. 
The plates were incubated at 37 °C for 1 h and the absorbance at 405 nm 
determined using an Elida-5 microtiter plate reader (Physica. New York, NY). 
Control wells lacked either the primary and/or secondary antibody. This was 
done to obtain a titer for each human serum which would give a half-maximal 
reading in the ELISA assay. This titer for each human serum would be used in 
the inhibition ELISA. The ELISA microtiter plate would be sensitized with 
purified wild type PorB protein and washed as before. In a separate V-96 
polypropylene microtiter plate (Nunc, Inc.), varying amounts of either purified 
wild type PorB protein or the purified recombinant PorB protein were added in 
a total volume of 75 jil. The human sera were diluted in PBS-Brij solution to 
twice their half maximal titer and 75 ^1 added to each of the wells containing the 
PorB or recombinant PorB proteins. This plate was incubated for 2 hr at room 
temperature and centrifuged in a Sorvall RT6000 refrigerated centrifuge, 
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equipped with microtitcr plate carriers (Wilmington, DE) at 3000 rpm for 1 0 min. 
Avoiding the V-bottom, 100 from each well was removed and transferred to 
the sensitized and washed ELISA microtiter plate. The ELISA plates are 
incubated for an additional 2 hr, washed, and the conjugated second antibody 
added as before. The plate is then processed and read as described. The 
percentage of inhibition is then processed and read as described. The percentage 
of inhibition is calculated as follows: 

1 - (ELISA value with either PorB or rPorB protein added) 

x 100 

(ELISA value without the porB added) 



Results 



Polymerase Chain Reaction and Subcloning: A method to easily clone, 
genetically manipulate, and eventually obtain enough pure porin protein from any 
number of different neisserial porin genes for further antigenic and biophysical 
characterization has been developed. The first step toward this goal was cloning 
the porin gene from a Neisseria. Using a technique originally described by 
Feavers, et al (Feavers, I.M., et a/., Infect. Immun. 60:3620-3629 (1992)), the 
DNA sequence of the mature porin protein from a class 3, serotype 1 5 porin was 
amplified using the chromosome of meningococcal strain 8765 as a template for 
the PCR reaction. Appropriate endonuclease restriction sites had been 
synthesized onto the ends of the oligonucleotide primers, such that when cleaved, 
the amplified mature porin sequence could be directly ligated and cloned into the 
chosen expression plasmid. After 30 cycles, the PCR products shown in Figure 
2 were obtained. The major product migrated between 900bp and lOOObp which 
was in accord with the previous study (Feavers, I.M., et a/., Infect. Immun. 
(50:3620-3629 (1992)). However, a higher molecular weight product was not 
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seen, even though the PCR was conducted under low annealing stringencies 
(40°C; 50 mM KC1). 

To be able to produce large amounts of the cloned porin protein, the 
tightly controlled expression system of Studier, el al (Studier and Moffatt, J. 
Mol Biol 759:1 13-130 (1986)) was employed, which is commercially available 
through Novagen Inc. The amplified PCR product was cloned into the BamHl- 
Xhol site of plasmid pET-17b. This strategy places the DNA sequence for the 
mature porin protein in frame directly behind the T7 promoter, the DNA sequence 
encoding for the 9 amino acid leader sequence and 1 1 amino acids of the mature 
<J) 10 protein. The Studier E, col: strain BL21 lysogenic for the DE3 lambda 
derivative (Studier and Moffatt, J. Mol Biol 189\\ 13-130 (1986)) was selected 
as the expression host for the pET-1 7b plasmid containing the porin gene. But 
because it was thought that the OmpA protein, originating from the E. coli 
expression host, might tend to co-purify with the expressed meningococcal porin 
protein, a modification of this strain was made by PI transduction which 
eliminated the ompA gene from this strain. Thus, after restriction endonuclease 
digestion of both the PCR product and the pET-17b vector and ligation, the 
product was transformed into BL21(DE3)-Ar>/n/?A and transformants selected for 
ampicillin and tetracycline resistance. The restriction map of pET-1 7b is shown 
in Figure 1 1 A, while the nucleotide sequence between the Bgll\ and Xhol sites of 
pET-17b is shown in Figure 1 IB. Of the numerous colonies observed on the 
selection plate, 10 were picked for further characterization. All ten expressed 
large amounts of a protein, which migrated at the approximate molecular weight 
of the PorB protein, when grown to log phase and induced with IPTG. The whole 
cell lysate of one such culture is shown in Figure 3a. The western blot analysis 
with the 4D11 monoclonal antibody further suggested that the protein being 
expressed was the PorB protein (Figure 3b). As opposed to other studies, when 
neisserial porins have been cloned and expressed in E. coli, the host bacterial cells 
showed no signs of any toxic or lethal effects even after the addition of the IPTG. 
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The E. coli cells appeared viable and could be recultured at any time throughout 
the expression phase. 

Nucleotide sequence analysis: The amount of PorB expressed in these 
experiments was significantly greater than that previously observed and there 
5 appeared to be no adverse effects of this expression on the host E. colL To be 

certain that no PCR artifacts had been introduced into the meningococcal porin 
gene to allow for such high expression, the entire cj> 1 0 porin fusion was 
sequenced by double stranded primer extension from the plasmid. The results are 
shown in Figure 4. The nucleotide sequence was identical with another 

10 meningococcal serotype 15 PorB gene sequence previously reported by Heckels, 

et al (Ward, M.J., et al, FEMS Microbiol. Lei!. 73:283-289 (1992)) with two 
exceptions which are shown. These two nucleotide differences each occur in the 
third position of the codon and would not alter the amino acid sequence of the 
expressed protein. Thus, from the nucleotide sequence, there did not appear to 

15 be any PCR artifact or mutation which might account for the high protein 

expression and lack of toxicity within the E. coli. Furthermore, this data would 
suggest that a true PorB protein was being produced. 

Purification of the expressed porB gene product: The PorB protein 
expressed in the E. coli was insoluble in TEN buffer which suggested that when 

20 expressed, the PorB protein formed into inclusion bodies. However, washing of 

the insoluble PorB protein with TEN buffer removed most of the contaminating 
E. coli proteins. The PorB protein could then be solubilized in freshly prepared 
8M urea and diluted into the Zwittergent 3,14 detergent. The final purification 
was accomplished, using a Sephacryl S-300 molecular sieve column which not 

25 only removed the urea but also the remaining contaminating proteins. The 

majority of the PorB protein eluted from the column having the apparent 
molecular weight of trimers much like the wild type PorB. The comparative 
elution patterns of both the wild type and the PorB expressed in the E. coli are 
shown in Figure 5. It is important to note that when the PorB protein 

30 concentration in the 8 M urea was in excess of 10 mg/ml prior to dilution into the 
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Zwittergent detergent, the relative amounts of PorB protein found as trimers 
decreased and appeared as aggregates eluting at the void volume. However, at 
protein concentrations below 10 mg/ml in the urea buffer, the majority of the 
PorB eluted in the exact same fraction as did the wild type PorB. It was also 
determined using a T7-Tag monoclonal antibody and western blot analysis that 
the 1 1 amino acids of the mature T7 capsid protein were retained as the amino 
terminus. The total yield of the meningococcal porin protein from one liter of 
E. coli was approximately 50 mg. 

Inhibition ELISA Assays. In order to determine if the purified trimeric 
recombinant PorB had a similar antigenic conformation as compared to the PorB 
produced in the wild type meningococcal strain 8765, the sera from six patients 
which had been vaccinated with the wild type meningococcal Type 15 PorB 
protein were used in inhibition ELISA assays. In the inhibition assay, antibodies 
reactive to the native PorB were competitively inhibited with various amounts of 
either the purified recombinant PorB or the homologous purified wild type PorB. 
The results of the inhibition with the homologous purified PorB of each of the six 
human sera and the mean inhibition of these sera are shown in Figure 6. The 
corresponding inhibition of these sera with the purified recombinant PorB is seen 
in Figure 6B. A comparison of the mean inhibition from Figure 6 and 7 are 
plotted in Figure 8. These data would suggest that the antibodies contained in the 
sera of these six patients found similar epitopes on both the homologous purified 
wild type PorB and the purified recombinant PorB. This gave further evidence 
that the recombinant PorB had regained most if not all of the native conformation 
found in the wild type PorB. 
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Example 2. Cloning of the Class 2 Porin from Group B 
Neisseria Meningitidis strain BNCV M986 

Genomic DNA was isolated from approximately 0.5g of Group B 
Neisseria meningitidis strain BNCV M986 (serotype 2a) using previously 
5 described methods (Sambrook ei al. t Molecular Cloning: A Laboratory Manual, 

2nd ed., Cold Spring Harbor, New York, Cold Spring Harbor Laboratory Press 
(1989)). This DNA then served as the template for two class 2 porin specific 
oligonucleotides in a standard PCR reaction. These oligonucleotides were 
designed to be complementary to the 5' and 3' flanking regions of the class 2 

10 porin and to contain EcoRl restriction sites to facilitate the cloning of the 

fragment. The sequences of the oligonucleotides were as follows: 
5' AGC GGC TTG GAA TTC CCG GCT GGC TTA A AT TTC 3' and 
5' CAA ACG AAT GAA TTC AAA TAA AAA AGC CTG 3'. 
The polymerase chain reaction was then utilized to obtain the class 2 porin. The 

15 reaction conditions were as follows: BNCV M986 genomic DNA 200ng, the two 

oligonucleotide primers described above at 1 |iM of each, 200 |aM of each dNTP, 
PCR reaction buffer (10 mM Tris HC1, 50 mM KCl pH 8.3), 1 .5 mM MgCk and 
2.5 units of Tag polymerase, made up to 100 \xl with distilled H-,0. This reaction 
mixture was then subjected to 25 cycles of 95 °C for 1 min, 50°C for 2 min and 

20 72°C for 1.5 min. At the end of the cycling period, the reaction mixture was 

loaded on a 1% agarose gel and the material was electrophoresed for 2h after 
which the band at 1.3 kb was removed and the DNA recovered using the Gene 
Clean kit (Bio 101). This DNA was then digested with £coRI, repurified and 
ligated to EcoRI digested pUC19 using T 4 DNA ligase. The ligation mixture was 

25 used to transform competent E. coli DH5a. Recombinant plasmids were selected 

and sequenced. The insert was found to have a DNA sequence consistent with 
that of a class 2 porin. See, Murakami, K. et al., Infect. Immun. 57:2318-2323 
(1989). 
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The plasmid pET-17b (Novagen) was used to express the class 2 porin. 
As described below, two plasmids were constructed that yielded two different 
proteins. One plasmid was designed to produce a mature class 2 porin while the 
other was designed to yield a class 2 porin fused to 20 amino acids from the T7 
gene cj> 1 0 capsid protein. 

Construction of the mature class 2 porin 

The mature class 2 porin was constructed by amplifying the pUC19-class 
2 porin construct using the oligonucleotides: 5'-CCT GTT GCA GCA CAT ATG 
GAC GTT ACC TTG TAC GGT ACA ATT AAA GC-3' and 5 '-CGA CAG 
GCT TTT TCT CGA GAC CAA TCT TTT CAG -3 '. This strategy allowed the 
cloning of the amplified class 2 porin into the AWel and A7?ol sites of the plasmid 
pET-1 7b thus producing a mature class 2 porin. Standard PCR was conducted 
using the pUC19-class 2 as the template and the two oligonucleotides described 
above. This PCR reaction yielded a 1.1 kb product when analyzed on a 1.0% 
agarose gel. The DNA obtained from the PCR reaction was gel purified and 
digested with the restriction enzymes AWel and Xho\. The 1 . 1 kb DNA produced 
was again gel purified and ligated to N<Je\ and A7?ol digested pET-17b using T 4 
DNA ligase. This ligation mixture was then used to transform competent E. coli 
DH5a. Colonies that contained the 1 .lkb insert were chosen for further analysis. 
The DNA from the DH5a clones was analyzed by restriction mapping and the 
cloning junctions of the chosen plasmids were sequenced. After this analysis, the 
DNA obtained from the DH5a clones was used to transform £. coli BL2 1 (DE3)- 
AompA. The transformants were selected to LB-agar containing 100 Mg/ml of 
carbenicillin. Several transformants were screened for their ability to make the 
class 2 porin protein. This was done by growing the clones in LB liquid medium 
containing 100 |Lig/mI of carbenicillin and 0.4% glucose at 30°C to OD 60() - 0.6 
then inducing the cultures with IPTG (0.4 mM), The cells were then disrupted 
and the cell extract was analyzed by SDS-PAGE. The nucleotide sequence and 



WO 97/28273 



PCT7US97/01687 



-52- 

translated amino acid sequence of the mature class II porin gene cloned into pET- 
1 7b are shown in Figures 9 A and 9B. 

Construction of the fusion class 2 porin 

The fusion class 2 porin was constructed by amplifying the pUC19-class 
2 porin construct using the oligonucleotides: 5'-CCT GTT GCA GCG GAT CCA 
GAC GTT ACC TTG TAC GGT AC A ATT AAA GC- 3' and 5'-CGA CAG 
GCT TTT TCT CGA GAC CAA TCT TTT CAG -3'. This strategy allowed the 
cloning of the amplified class 2 porin into the BamYll and Xhol sites of the 
plasmid pET-17b thus producing a fusion class 2 porin containing an additional 
22 amino acids at the N-terminus derived from the T7 (J) 10 capsid protein 
contained in the plasmid. Standard PCR was conducted using the pUC19-class 
2 as the template and the two oligonucleotides described above. The PCR 
reaction yielded a 1 . 1 kb product when analyzed on a 1 .0% agarose gel. The DNA 
obtained from the PCR reaction was gel purified and digested with the reaction 
enzymes BamYll and Xhol. The 1 . 1 kb product produced was again gel purified 
and ligated to BamYll and Xhol digested pET-17b using T 4 DNA ligase. This 
ligation mixture was then used to transform competent E. coli DH5a. Colonies 
that contained the 1.1 kb insert were chosen for further analysis. The DNA from 
the DH5a clones was analyzed by restriction enzyme mapping and the cloning 
junctions of the chosen plasmids were sequenced. The nucleotide sequence and 
translated amino acid sequence of the fusion class II porin gene cloned into the 
expression plasmid pET-17b are shown in Figures 10A and 10B. After this 
analysis, the DNA obtained from the DH5a clones was used to transform E. coli 
BL2 1 (DE3)- AompA. The transformants were selected on LB-agar containing 1 00 
|ig/ml of carbenicillin. Several transformants were screened for their ability to 
make the class 2 porin protein. This was done by growing the clones in LB liquid 
medium containing 100 jig/ml of carbenicillin and 0.4% glucose at 30°C to OD 6(K) 
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= 0.6 then inducing the cultures with IPTG (0.4 mM). The cells were then 
disrupted and the cell extract was analyzed by SDS-PAGE. 

Example 5. Cloning and Expression of the Mature class 3 porin 
from Group B Neisseria meningitidis strain 8765 in 
E. coli 

Genomic DNA was isolated from approximately 0.5 g of Group B 
Neisseria meningitidis strain 8765 using the method described above (Sambrook 
etal, Molecular Cloning: A Laboratory Manual 2nd ed.. Cold Spring Harbor, 
New York, Cold Spring Harbor Laboratory Press ( 1 989)). This DNA then served 
as the template for two class 3 porin specific oligonucleotides in a standard PCR 
reaction. 

The mature class 3 porin was constructed by amplifying the genomic 
DNA from 8765 using the oligonucleotides: 5'-GTT GCA GCA CATATG GAC 
GTT ACC CTG TAC GGC ACC-3' and 5'-GGG GGG ATG GAT CCA GAT 
TAG A AT TTG TGG CGC AG A CCG AC A CC-3'. This strategy allowed the 
cloning of the amplified class 3 porin into the Nde\ and BamHl sites of the 
plasmid pET-24a+ (Figures 13A and 13B), thus producing a mature class 3 porin. 
Standard PCR was conducted using the genomic DNA isolated from 8765 as the 
template and the two oligonucleotides described above. 

The reaction conditions were as follows: 8765 genomic DNA 200 ng, the 
two oligonucleotide primers described above at 1 ^iM of each, 200 jiM of each 
dNTP, PCR reaction buffer (10 mM Tris HCI, 50 mM KCJ, pH 8.3), 1.5 mM 
MgCl 2 , and 2.5 units of Taq polymerase, and made up to 100 ^1 with distilled 
water. This reaction mixture was then subjected to 25 cycles of 95 °C for 1 min, 
50°C for 2 min and 72°C for 1 .5 min. 

This PCR reaction yielded about 930 bp of product, as analyzed on a 1% 
agarose gel. The DNA obtained from the PCR reaction was gel purified and 
digested with the restriction enzymes Ndel and BamHl. The 930 bp product was 
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again gel purified and ligated to Ndel and BamHl digested pET-24a(+) using T4 
ligase. This ligation mixture was then used to transform competent £*. coli DH5a. 
Colonies that contained the 930 bp insert were chosen for further analysis. The 
DNA from the E. coli DH5ct clones was analyzed by restriction enzyme mapping 
and cloning junctions of the chosen plasmids were sequenced. After this analysis, 
the DNA obtained from the £. coli DH5a clones was used to transform E. coll 
BL21 (DE3)- AompA. The transformants were selected on LB-agar containing 50 
Hg/ml of kanamycin. Several transformants were screened for their ability to 
make the class 3 porin protein. This was done by growing the clones in LB liquid 
medium containing 50 |ag/ml of kanamycin and 0.4% of glucose at 30°C to OD 60() 
= 0.6 then inducing the cultures with IPTG (1 mM). The cells were then 
disrupted and the cell extract was analyzed by SDS-PAGE. 

Example 4. Purification and refolding of recombinant class 2 
porin 

E coli strain BL21(DE3)Ao/7?/?A [pNV-5] is grown to mid-log phase (OD 
= 0.6 at 600 run) in Luria broth at 30°C. IPTG is then added (0.4 mM final) and 
the cells grown an additional two hours at 37 °C. The cells were then harvested 
and washed with several volumes of TEN buffer (50 mM Tris-HCK 0.2 M NaCl, 
10 mM EDTA, pH = 8.0) and the cell paste stored frozen at -75 °C. 

For purification preweighed cells are thawed and suspended in TEN 
buffer at a 1:15 ratio (g/v). The suspension is passed through a Stansted cell 
disrupter (Stansted fluid power Ltd.) twice at 8,000 psi. The resultant solution 
is then centrifuged at 1 3,000 rpm for 20 min and the supernatant discarded. The 
pellet is then twice suspended in TEN buffer containing 0.5% deoxycholate and 
the supernatants discarded. The pellet is then suspended in TEN buffer 
containing 8 M deionized urea (electrophoresis grade) and 0.1 mM PMSF (3 
g/lOml). The suspension is sonicated for 10 min or until an even suspension is 
achieved. 10 ml of a 10% aqueous solution of 3 J4-zwittergen (Calbiochem) is 
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added and the solution thoroughly mixed. The solution is again sonicated for 10 
min. Any residual insoluble material is removed by centrifugation. The protein 
concentration is determined and the protein concentration adjusted to 2 mg/ml 
with 8 M urea- 10% zwittergen buffer (1 :1 ratio). 

This mixture is then applied to a 2.6 x 1 00 cm column of Sephacryl S-300 
equilibrated in 100 mM Tris-HCl, 1 M NaCl, 10 mM EDTA, 20 mM CaCU, 
0.05% 3,14-zwittergen, 0.02% sodium azide, pH - 8.0. The flow rate is 
maintained at 1 ml/min. Fractions of 10 ml are collected. The porin refolds into 
trimer during the gel filtration. The OD = 280 nm of each fraction is measured 
and those fractions containing protein are subjected to SDS gel electrophoresis 
assay for porin. Those fractions containing porin are pooled. The pooled 
fractions are either dialyzed or diluted 1 : 10 in 50 mM Tris HC1 pH = 8.0, 0.05% 
3,14-zwittergen, 5 mM EDTA, 0.1 M NaCl. The resulting solution is then 
applied to a 2.6 x 10 cm Q sepharose high performance column (Pharmacia) 
equilibrated in the same buffer. The porin is eluted with a linear gradient of 0.1 
to 1 M NaCl. 

Example 5. Purification and refolding of recombinant class 3 
porin 

E coli strain BL21 (DE3) AompA containing the porB-pET-1 7b plasmid 
is grown to mid-log phase (OD = 0.6 at 600 nm) in Luria broth at 30 °C. IPTG 
is then added (0.4 mM final) and the cells grown an additional two hours at 37°C. 
The cells were then harvested and washed with several volumes of TEN buffer 
(50 mM Tris-HCl, 0.2 M NaCl, 10 mM EDTA, pH - 8.0) and the cell paste 
stored frozen at -75 °C. 

For purification about 3 grams of cells are thawed and suspended in 9 ml 
of TEN buffer. Lysozyme is added (Sigma, 0.25 mg/ml) deoxycholate (Sigma, 
1.3 mg/ml) plus PMSF (Sigma, ng/ml) and the mixture gently shaken for one 
hour at room temperature. During this time, the cells lyse and the released DNA 
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causes the solution to become very viscous. DNase is then added (Sigma. 2 
|ag/ml) and the solution again mixed for one hour at room temperature. The 
mixture is then centrifuged at 15K rpm in a S-600 rotor for 30 minutes and the 
supernatant discarded. The pellet is then twice suspended in 1 0 ml of TEN buffer 
5 and the supernatants discarded. The pellet is then suspended in 1 0 ml of 8 M urea 

(Pierce) in TEN buffer. The mixture is gently stirred to break up any clumps. 
The suspension is sonicated for 20 minutes or until an even suspension is 
achieved. 10 ml of a 10% aqueous solution of 3,14-zwittergen (Calbiochem) is 
added and the solution thoroughly mixed. The solution is again sonicated for 10 

10 minutes. Any residual insoluble material is removed by centrifugation. The 

protein concentration is determined and the protein concentration adjusted to 2 
mg/ml with 8 M urea- 10% zwittergen buffer (1:1 ratio). 

This mixture is then applied to a 1 80 x 2.5 cm column of Sephacryl S-300 
(Pharmacia) equilibrated in 100 mM Tris-HCL 1 M NaCI. 1 0 mM EDTA, 20 mM 

15 CaCl 2 , 0.05% 3,14-zwittergen, pH = 8.0. The flow rate is maintained at 1 

ml/min. Fractions of 10 ml are collected. The porin refolds into trimer during the 
gel filtration. The OD 2S o nm of each fraction is measured and those fractions 
containing protein are subjected to SDS gel electrophoresis assay for porin. 
Those fractions containing porin are pooled. 

20 The pooled fractions are dialyzed and concentrated 4-6 fold using Amicon 

concentrator with a PM 10 membrane against buffer containing 100 mM Tris- 
HC1, 0.1 M NaCl, 10 mM EDTA, 0.05% 3,14-zwittergen, pH = 8.0. 
Alternatively, the pooled fractions are precipitated with 80% ethanol and 
resuspended with the above-mentioned buffer. Six to 10 nig of the material is 

25 then applied to a monoQ 10/10 column (Pharmacia) equilibrated in the same 

buffer. The porin is eluted from a shallow 0.1 to 0.6 M NaCl gradient with a 
1.2% increase per min over a 50 min period. The Flow rate is 1 ml/min. The 
peak containing porin is collected and dialyzed against TEN buffer and 0.05% 
3,14-zwittergen. The porin may be purified further by another S-300 

30 chromatography . 
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Example 6. Purification and chemical modification of the 
polysaccharides 

The capsular polysaccharide from both group B Neisseria meningitidis 
and E. coli Kl consists of a(2-8) polysialic acid (commonly referred to as 
GBMP or Kl polysaccharide). High molecular weight polysaccharide isolated 
from growth medium by precipitation (see, Frasch, C.E., "Production and Control 
of Neisseria meningitidis Vaccines" in Bacterial Vaccines, Alan R. Liss, Inc., 
pages 123-145 (1990)) was purified and chemically modified before being 
coupled to the porin protein. The high molecular weight polysaccharide was 
partially depolymerized with 0.1 M acetic acid (7 mg polysaccharide/ml), pH = 
6.0 to 6.5 (70°C, 3 hrs) to provide polysaccharide having an average molecular 
weight of 12,000-16,000. After purification by gel filtration column 
chromatography (Superdex 200 prep grade, Pharmacia), the polysaccharide was 
N-deacetylated in the presence of NaBH 4 and then N-propionylated as described 
by Jennings et al (J. Immunol. 7J7:1808 (1986)) to afford N-Pr GBMP (see 
Example 14). Treatment with NaI0 4 followed by gel filtration column 
purification gave the oxidized N-Pr GBMP having an average molecular weight 
of 12,000 daltons. 

Example 7. Coupling of oxidized N-Pr GBMP to the group B 
meningococcal class 3 porin protein (PP) 

The oxidized N-Pr GBMP (9.5 mg) was added to purified class 3 porin 
protein (3.4 mg) dissolved in 0.21 ml of 0.2 M phosphate buffer pH 7.5 which 
also contained 10% octyl glucoside. After the polysaccharide was dissolved, 
sodium cyanoborohydride (7 mg) was added and the reaction solution was 
incubated at 37°C for 4 days. The reaction mixture was diluted with 0.15 M 
sodium chloride solution containing 0.01% thimerosal and separated by gel 
filtration column chromatography using Superdex 200 PG. The conjugate (N-Pr 
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GBMP-PP) was obtained as single peak eluting near the void volume. Analysis 
of the conjugate solution for sialic acid and protein showed that the conjugate 
consists of 43% polysaccharide by weight. The porin protein was recovered in 
the conjugate in 44% yield and the polysaccharide in 12% yield. The protein 
recoveries in different experiments generally occur in the 50-80% range and those 
of the polysaccharide in the 9-13% range (see also Example 14). 

Example 8. Immunogenicity studies 

The immunogenicities of the N-Pr GBMP-PP conjugate and those of the 
N-Pr GBMP-Tetanus toxoid (N-Pr GBMP-TT) conjugate which was prepared by 
a similar coupling procedure were assayed in 4-6 week old outbread Swiss 
Webster CFW female mice. The polysaccharide (2 fig)-conjugate was 
administered on days 1, 14 and 28, and the sera collected on day 38. The 
conjugates were administered as saline solutions, adsorbed on aluminum 
hydroxide, or admixed with stearyl tyrosine. The sera EL1SA titers against the 
polysaccharide antigen and bactericidal titers against N. meningitidis group B are 
summarized in Table 1 . 

Example 9. Expression of group B Neisseria meningitidis Outer 
Membrane (MB 3) Using Yeast Pick i a pastor is 
Expression System 

Materials and Methods 
Strains and Plasmids 

Pichia pastoris GS 115 (provided by Invitrogen) has a defect in the 
histidinol dehydrogenase gene (his4) which prevents it from synthesizing 
histidine. All expression plasmids carry the HIS4 gene which complements his4 
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in the host, so transformants are selected for their ability to grow on histidine- 
deficient medium. Until transformed, GS 1 1 5 will not grow on minimal medium 
alone. 

Expression vectors 

5 Four different expression vectors were used that include the strong, 

highly-inducible AOX1 promoter for expression of foreign protein (Pichia 
Expression Kit, Invitrogen). One vector, pHIL-D2, is used for intracellular 
expression, while the other three (pHIL-Sl, pPIC9, and pPIC9K) are used for 
secreted expression. Maps of the pHIL-D2, pHIL-Sl , and pPIC9 vectors may be 

10 found on pp. 19-22 of the Invitrogen Instruction Manual for the Pichia 

Expression Kit, Version E, the contents of which is hereby incorporated by 
reference. Secretion requires the presence of a signal sequence on the expressed 
protein to target it to the secretory pathway. To improve the chances for success, 
two different kinds of vectors are included in the kit. The vector pHIL-Sl carries 

15 a native Pichia pastoris signal from the acid phosphatase gene. PHOl. The 

vectors, pPIC9 and pPIC9K (with corrected HIS4 region), both carry the secretion 
signal from the 5. cerevisiae a-mating factor pre-pro peptide. The advantage of 
expressing secreted proteins is that P. pastoris secretes very low levels of native 
proteins. Thus, the secreted heterologous protein comprises the vast majority of 

20 the total protein in the media and serves as the first step in purification of the 

protein (Barr et aL, Pharm. Eng. I2(2):4%-5\ (1992)). 

Cloning of the meningococcal B class 3 protein gene (MB3) 

The genomic DNA of Group B Neisseria meningitidis (strain 8765) 
served as the template for the amplification of class 3 porin (MB3) in a standard 
25 PCR. The amplified DNA fragment (930 b.p. long) of the mature porin protein 

was ligated in Nde I - BamH I cloning sites of the pET-24a cloning/expression 
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vector, originally constructed by Studier etal, J. Mol Biol 189:1 13-130 (1986); 
Meth. Enzymol 755:60-89(1990); J, Mol Biol 219:37-44 (1991), and 
manufactured by Novagen. The pET vectors were developed for cloning and for 
expressing target DNA fragments under the strong T7 transcription and 
5 translation signals. Expression from the T7 promoter is induced by providing the 

host cell with a source of T7 RNA polymerase. Newer, more convenient vectors 
utilizing the T7 expression system are now available from Novagen (Madison, 
WI 5371 1). The T7 expression system was successfully used for the expression 
of MB3 in E. coli (see Example 3). 

10 The optimization of the translation elongation rate for the expressed MB3 gene 

Codon usage is known to affect the translational elongation rate, and 
therefore it has been considered an important factor in affecting product yields 
(Romanos et al, Yeast 5:423-488 (1992)). There is evidence that codon usage 
may affect both yield and quality of the expressed protein. A number of highly 

15 expressed genes show a strong bias toward a subset of codons (Bennetzen et al, 

J. Biol Chem. 257:3026-3031 (1982). This "major codon bias," which can vary 
greatly between organisms, is thought to be a growth optimization strategy. This 
mechanism allows an organism to be capable of efficient translation of highly 
expressed genes during rapid growth, as only a subset of tRNAs and aminoacyl- 

20 tRNA synthetases need to be present in high concentrations. Kurland et al., TIBS 

72:126-128 (1987). In cases where mRNA contains rare codons, aminoacyl- 
tRNAs may become limited, increasing the probability of amino acid 
misincorporations, and possibly causing ribosomes to drop off. Indeed, a high 
misincorporation frequency has recently been observed in a foreign protein 

25 produced in E. coli (Scorer et al, Nucleic Acids Res. 79.351 1-3516 (1991)). 

Moreover, proteins containing amino acid misincorporations are difficult to 
purify and may have both impaired activity and antigenicity. The presence of 
several rare codons has been shown to limit the production of tetanus toxin 
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fragment C in E. coli (Makoff et al. Nucleic Acids Res. 1 7: 1 01 91 - 1 020 ] (1 989)). 
In yeast, Hoekema et al. (Mol. Cell Biol. 7: 2914-2924 (1987)) showed that 
substitution of a large proportion of preferred codons for rare codons in the 5' 
portion of the PGK (phosphoglycerate kinase) gene caused a decrease in 
expression levels. Recently, the expression of an immunoglobulin kappa chain 
in yeast has been shown to be increased 50-fold when a synthetic codon- 
optimized gene is used, although the level of kappa chain mRNA remains the 
same. 

Significant differences between codon usage profiles of Pichia and MB3 
were found (Table 5). In order to optimize the translation efficiency, particularly 
at the beginning of translation elongation, codons optimal for Pichia were 
introduced into the 5' region of the MB3 gene. When constructing the linker 
used to clone MB3 into pHIL-SK the oligomers were synthesized so that they 
contained sequence optimized for Pichia expression. A 51 nucleotide long 
oligomer (51-mer) was synthesized for this purpose. The sequence of the 
oligomer is: 

5'-tcgagacgtcactttgtacggtactattaaggctggtgttgaga 
cttcccg-3' 

A 47 nucleotide oligomer complementary to the 51-mcr was also synthesized. 
The sequence of this oligomer is: 

S'-CGGGAAGTCTCAACACCAGCCTTAATAGTACCGTACAAAGTGAC 
GTC-3' 

These two oligomers, which contain Xhol and Bsr\ restriction sites, were 
annealed to serve as a connector, and then ligated to vector pHIL-S 1 , which had 
been linearized with Xho\ digestion. The ligated fragment was then digested with 
BamHl, gel purified, and ligated with an MB3 fragment obtained from cutting the 
pNV15 vector with both Bsrl and Bam HI enzymes. The fragment was then 
cloned into the Pichia pHlL-Sl expression vector. The new DNA sequence of 
the 5' region of MB3 was verified by DNA sequencing of pHIL-Sl/MB3 isolated 
from Pichia. 
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The sequence of the original 5' end of the gene for mature MB3 (from NT 

I) is: 

gac gtt acc ctg tac ggc acc att aaa gcc ggc gta gaa act tec cgc tct gta ttt cac cag aac ggc 
DVTL YG T I K A GV ETS RSVF H 0 N G 

5 caa gtt act gaa gtt aca 

0 V T E V T 

The codon-optimized sequence of the same fragment (replaced nucleotides 
showed as capital letters), along with its corresponding amino acid sequence is: 

gac gtC acT Ttg tac ggT acT att aaG gcT ggT gtT gaG act tec cgc tct gta ttt cac cag aac 
10 DVTLYGTIKAGV ETSRSVFHQN 

ggc caa gtt act gaa gtt aca 
G Q V T E V T 

Vector pHIL-Sl/MB3, containing the codon-optimized MB3 DNA. 
served as the template for the amplification of MB3 in a standard PCR. 

15 Oligomers were synthesized to serve as PCR primers. The PCR fragments of 

MB3 were inserted into Pichia expression vectors either directly or by using the 
Original TA Cloning Kit (Invitrogen); details are given below. 

For the cloning of MB3 into the EcoRl site of pHIL-D2: 
Forward primer (39 nt. having an engineered EcoR] site and a sequence (5'ATG) 

20 encoding an initiation methionine): 

5 , -CGAGAATTCATGGACGTCACTTTGTACGGTACTATFAAG-3' 
Reverse primer (45 nt, having an engineered EcoRl site and stop codon): 
5 I -GCTGAATTCTTAGAATTTG'rGGCGCAGACCGACACCGCCGGCAGr-3' 
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For the cloning of MB3 into the EcoRhAvrll sites of pPIC9 and pPIC9: 
Forward primer (39 nucleotides (nt), having an engineered EcoRl site; no 
sequence encoding an initiation methionine was necessary because the leader 
peptide had an initiation methionine): 

5'-AGCGAATTCGACGTCACTTTGTACGGl ACTA1TAAGGCT-3' 
Reverse primer (36 nt, having an engineered AvrW site and stop codon): 
5'-CACCCTAGGTTAGAATTTGTGACGCAGACCGACACC-3' 

For PCR amplification of the complete MB3 gene, Vent® DNA 
polymerase (NEB) was used. The fidelity of this polymerase is 5- 1 5-fold higher 
than that observed for Taq DNA polymerase. To generate an expression cassette 
plasmid, PCR fragments of MB3 (full length and truncated fragments) were 
inserted in Pichia expression vectors either directly or using the Original TA 
Cloning® Kit (Invitrogen), which includes a pCR™II vector for subcloning of 
PCR fragments. Direct cloning of DNA amplified by either Vent® DNA 
polymerase or Pfu DNA polymerase into the vector pCR™II is difficult, as the 
cloning efficiency is often very low. This is due to the 3' to 5' exonuclease 
proofreading activity of Vent® and Pfu, which removes the 3' A overhangs that 
are necessary for TA cloning, leaving blunt ends. The Original TA Cloning® Kit 
allows these blunt-ended fragments to be cloned. Use of this method eliminates 
any enzymatic modifications of the PCR product, and does not require the use of 
PCR primers containing restriction sites. To increase the cloning efficiency 
further, the Invitrogen protocol was modified as follows. Following 
amplification with Vent* or Pfu (see manual for The Original TA Cloning*' Kit, 
protocol for the addition of 3' A-overhangs post amplification, p. 1 9), rather than 
placing the vial on ice, as recommended in the kit, the mineral oil in the PCR 
mixture was immediately removed using Parafilm™. This was accomplished by 
pouring the PCR mixture onto the Parafilm, and zigzagging the drop down the 
surface of the Parafilm with a gentle rocking motion until all of the oil had 
adhered to the Parafilm surface. The reaction mixture, now free of oil. was then 
collected into a fresh tube. The Invitrogen protocol was then resumed with the 
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addition of Taq polymerase. This method allowed the difficult cloning of PCR 
fragments into large expression vectors. 

The expression cassette of the integrating vector (Invitrogen) contains the 
methanol-induced AOX1 promoter and its terminator, flanked by stretches of 
nucleotides up- and downstream from the AOX 1 gene. The P. pastohs His4 gene 
served as an auxotrophic marker. These vectors do not contain a yeast ori 9 hence 
His + colonies must correspond to integration of the expression cassette. All PCR 
fragments of MB3 were inserted in frame with a Pichia Kozak consensus 
sequence (CAAAAAACAA) (Cavenor et al Nucleic Acids Res. 19:3 1 85-3 192 
(1991); Kozak Nucleic Acids Res. 75:8125-8148 (1987); Kozak Proc. Natl Acad. 
Sci. USA 57:8301-8305 (1990)) to provide the best translation initiation of the 
MB3 gene. All inserts were placed under the control of the AOX1 promoter to 
driv^e expression of the gene of interest. After the ligation of the MB3 fragment 
in an appropriate expression vector, chemically competent E. coli cells were 
transformed (TOP 10F') (¥' {proAB, laql q , /acZAM15, Tn/tf (Tet R )} mcrA, 
A(mrr-hsdRMS-mcrBC\ 4>80 /acZAM15, A/acX74, deoR, recA\, araD\39, 
A(ara-leu)7697, gaRJ, galK 9 rpsL(Str R ), end A 1 , nupGX). Other strains which 
may be suitable are DH5a F\ JM 1 09, or any other strain that carries a selectable 
F' episome and is recA deficient (endA is preferable) (Pichia Expression Kit 
Instruction Manual, Invitrogen). Colonies with an MB3 insert were used for the 
preparation of CsCl purified maxi-prep of a plasmid DNA for Pichia 
transformation (Sambrook, J. er al, Eds., Molecular Cloning: A Laboratory 
Manual. 2nd. Ed., Cold Spring Harbor Press (1989), pp. 1.42-1.43). Restriction 
analysis and DNA sequencing (DNA Sequencing Kit, Version 2 (USB)) 
confirmed that these constructs were correct. 

Modification of the starting MB3 sequence was especially useful for 
intracellular expression of the porin gene (pHIL-D2/MB3 construct). Because the 
other constructs (pHIL-Sl/MB3 and pPIC9/MB3) used for MB3 secretion 
contained codons optimal for Pichia in the leader peptide sequence upstream of 
the MB3 insert, the initiation of translation was not rate-limiting. In contrast, the 
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pHIL-D2 vector does not include any leader sequence and the initiation of 
translation must be started from the rare codons of the MB3 insert. The 
optimization of this sequence is believed to be responsible for the fact that pHIL- 
D2/MB3 constructs gave the highest level of MB3 expression of any of the 
clones tested (Tables 3, 4). 

Transformation of yeast cells and DNA analysis of integrants 

Plasmid DNA was linearized with single or double (for higher integration 
efficiencies) digestion, and P. pastoris strain GS1 1 5 (his4 ) was transformed to 
the His + phenotype by the spheroplast method using Zymolyase followed by 
adsorption of transforming DNA and penetration of this DNA through the 
spheroplast pores into the Pichia cells in the presence of PEG and Ca* 2 {Pichia 
Expression Kit manual Invitrogen, pp.33-38). By replica plating or patching on 
Minimal Dextrose (MD: 1.34% yeast nitrogen base (YNB - Difco), 4x1 0 5 % 
biotin, 2% dextrose) versus Minimal Methanol (MM: 1.34% YNB, 4xl0' 5 % 
biotin, 0.5% methanol), it was possible to determine which His + transformants 
also exhibited disruption of the AOXI gene. Transformed spheroplasts were 
seeded on agarose-containing plates using selective growth medium without 
histidine (MD). At the end of 4-6 days, white separated colonies of yeast 
transformants had appeared. These colonies were picked up and were seeded on 
selective methanol-containing medium (MM) for screening of AOXI -disrupted 
(Mut s or Muf ) transformants (Pichia Expression Kit manual, Invitrogen, p. 60). 

Growth of the yeast and methanol induction 

Because recombination events can occur in many different ways which 
affect the level of protein expression (clonal variation), at least 16 verified 
recombinant clones were screened to determine the level of MB3 expression. 
These colonies were grown in 5 ml of glycerol-containing Buffered Glycerol- 
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complex Medium (BMGY: 1% yeast extract, 2% peptone, 100 mM potassium 
phosphate, pH 6.0, 1.34% YNB, 4x1 0 5 % biotin, 1.0% glycerol) (Pichia 
Expression Kit manual, Invitrogen, p. 61) at 30 °C in 50 ml 2098 Bluemax tubes 
(Falcon) in an Innova incubator shaker (New Brunswick Sci.) ("pilot" 
expression). After 1-2 days when cultures had reached an OD600 = 5-10, the 
cells were harvested by centrifugation (4000 rpm for 10 minutes at room 
temperature) and were resuspended in methanol-containing Buffered Methanol- 
complex Medium (BMMY: 1% yeast extract, 2% peptone, 100 mM potassium 
phosphate, pH 6.0, 1.34% YNB, 4x10*'% biotin, 0.5% methanol) {Pichia 
Expression Kit manual, Invitrogen, p. 61) for the induction of the A OX] 
promoter. To replenish exhausted methanol, 0.5% of fresh methanol was added 
each day to induced cells. Aliquots of the cells were collected every day for 6 
days by centrifugation, and stored (pellets and supernatants separately) at -70 °C 
before examining. The most promising clones were examined for the 
optimization of protein expression and to scale-up the expression protocol to 
produce more protein. 

Lysis of P. pastoris cells, analysis by SDS-PAGE and Western blot analysis 

Cells were broken by agitation in breaking buffer (50 mM sodium 
phosphate, pH 7.4; 1 mM PMSF(phenylmethyIsulfonyl fluoride), 1 mM EDTA 
and 5% glycerol). Equal volumes of acid-washed glass beads (0.5 mm in 
diameter) were added. The mixture was vortexed for a total of 4 min, 30 sec 
mixing each, followed by 30 sec on ice. The soluble fraction was recovered by 
centrifugation for 10 min at 14000 rpm at 4°C. Supernatant (or cell lysate, or 
fraction of "soluble" proteins) was removed and stored at -70°C, and the residual 
cell pellet was extracted by vortexing with SDS sample buffer (1% SDS, 5% 
beta-mercaptoethanol, 10% glycerol, 10 mM EDTA, 0.025% bromophenol blue) 
followed by boiling for 10 min. Lysates were centrifuged again and the aqueous 
layer was examined as fraction of "insoluble" or membrane associated proteins. 
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NOVEX pre-cast 8-16% gradient gels were used for separation of proteins 
according to the procedure of Laemmli (Nature 227:680-685 (1970)). Proteins 
separated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis 
(SDS-PAGE) were stained with Coomassie Brilliant Blue R250, or were 
transferred to polyvinylidene difluoride (PVDF) membrane using a Transblott 
apparatus (BioRad Laboratories) according to the company specification. 

The Western blot procedure was carried out without detergents, using 
only blocking procedures, as described by Sheng and Schuster (Bio Technique 
13: 704-708 (1992)) with some modifications. This method provides high 
specificity and sensitivity with a low background. For the transfer, both Western 
transfer membrane and the SDS-PAGE separating gel were equilibrated with 
transfer buffer (24mM Tris-HCl/192 mM glycine/ 20% methanol) for 20 minutes 
prior to electrotransfer. The transfer was performed at 90V and 4°C for 3-4 
hours. Transfer of proteins to PVDF membranes was monitored by the transfer 
of prestained molecular weight markers (BRL). 

Immunostaining of proteins was carried out as follows. The transfer 
membrane was rinsed with TBS (lOmM Tris-HCl/.09% NaCl, pH 7.2). The 
membrane was then incubated in 1% non fat dried milk PBS solution (M-PBS) 
with .02% sodium azide at 37°C for 3 hours (or at 4°C overnight). The 
membrane was then washed 3 times with TBS/0.5% BSA (BSA/TBS) and once 
with TBS. The membrane was then incubated with the primary mouse anti-MB3 
antibody (mouse polyclonal antisera against purified OMP class 3) diluted to 
about 1 :4000 in PBS/1 %BSA (BSA/PBS). and the membrane was again washed 
3 times with TBS/0.5% BSA (BSA/TBS) and once with TBS. The membrane 
was then incubated in 1% M-PBS at room temperature for 30 minutes with gentle 
shaking. The membrane was washed 3 times with TBS/0.5% BSA (BSA/TBS) 
and once with TBS. The membrane was then incubated in the secondary alkaline 
phosphatase-conjugated anti-mouse antibody (Kirkegaard & Perry Laboratory 
(KPL), Gaithersburg, MD) diluted 1 :4000 in 1% BSA/PBS. The membrane was 
then washed 2 times with 0.5% BSA/TBS and 3 limes with .25% Tween 20 in 
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PBS. These washing steps differed from those recommended by Sheng and 
Schuster; the improved protocol provided less background than did the wash 
steps of the reference, which utilized 6 washes in 0.5% BSA/PBS. The 
membrane was then incubated in alkaline phosphatase buffer (0,05% M Tris-HCl, 
pH 9.5; 10 mM MgCl 2 ), followed by incubation in BCIP/NBT substrate solution 
(KPL). The development was stopped by washing the membrane in PBS/50 mM 
EDTA. The limit of detection was about 2-5 ng of native MBS protein. 



Results and discussion 



The strategy used to insert the cDNA encoding the mature MB3 into 
expression vectors and the steps using this construct for the transformation of .P. 
pastoris are outlined below. First, the MB3 gene is cloned into one of the 4 
Pichia expression vectors. In the next step, the resulting construct is linearized 
by digestion with Not\ or Bg /II, and his4 Pichia spheroplasts are transformed 
with the linearized construct. In the following step, a recombination event 
occurs in vivo between the 5 ' and 3 AOXI sequences in the vector and in the 
genome, resulting in replacement of the AOXI gene with the MB3 gene. Next, 
the Pichia transformants are selected on histidine-deficient medium, on which 
only cells that have undergone gene replacement can grow. The one-step gene 
replacement method described for S. cerevisiae (Rothstein, Me/h. Enzymol. 
707:202-21 1 (1983)) was successfully used by Cregg el al. (Biological Research 
on Industrial Yeast, Vol. II, Stewart et al., eds. ,CRC Press, Boca Raton, pp. 1-1 8 
(1987)) for the replacement of the P. pastoris AOX1 structural gene. 
Transformation of GS1 1 5 with 10 /*g of linearized expression vectors (pH!L-D2, 
pHIL-Sl, pPIC9, and pPIC9K) with MB3 insert gave more than 100 colonies in 
each experiment. Thus, the procedure yielded >10 2 His colonies per fu.g DNA, 
which is comparable to that reported for the best results of P. pastoris 
transformations. These transformants have the ability to grow on histidine- 
deficient medium (MD-minimal dextrose), and so are His . About 10-40% of 
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these recombinants were "methanol slow" (Muf - "methanol utilization slow"), 
i.e., demonstrated impaired growth on media such as MM (minimal methanol), 
which contains methanol as the sole carbon and energy source. These His7Mut s 
transformants are a result of the replacement of the AOX1 structural gene with 
the MB3 expression cassette containing the His' gene via a double crossover 
event. Recombination events may also occur as integration or insertion (single 
crossover events) of the expression cassette into the 5' or 3' AOX1 region, which 
leaves the AOX1 gene intact. Among the His7Mut s clones, 25-35% were 
positive, MB3-expressing transformants (Table 2). The reason that the AOX1- 
deleted transformants grow at all on methanol medium is due to low-level 
expression of alcohol oxidase activity by the AOX2 gene product. Analysis of 
DNA isolated from these "positive" recombinants using PCR with 5' A OX J, 3' 
AOXl, 5' MB3, 3' MB3 and other specific primers, indicated that the AOX1 
structural gene was indeed replaced by the fragment containing the MB3 and 
HIS4 genes. Analysis of the DNA isolated from His7Mur transformants 
indicated that the AOXl structural gene was intact and that the entire vector 
containing His4 DNA had integrated elsewhere. Among 39 AOXl -disrupted 
transformants that expressed MB3, no HisVMut T transformants were found, 
indicating preference for \hzAOXl replacement mode of integration. 

The results of immunoblot analysis of 84 Pichia transformants indicated 
that one may express the MB3 protein using all of the constructed recombinant 
plasmids, pHIL-D2/MB3, pHIL-Sl/MB3, pPIC9/MB3. and P P1C9K/MB3 (Table 
3). Thirty-nine clones were isolated that expressed the MB3 protein. Antigenic 
specificity of expressed MB3 protein was examined and was confirmed by 
Western blot analysis using monoclonal and polyclonal antibodies raised against 
wild type N. meningitidis OMP class 3. These results led to the conclusion that 
all of the expression vectors were correctly constructed, and that the 
transformations of Pichia spheroplasts were properly performed. 

The amount of expressed MB3 was determined by densitometric scanning 
of the Coomassie brilliant blue stained protein bands fractionated by SDS-PAGE 
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using a Model GDS-7500 scanning densitometer (UVP Life Sci.) or Model 
IS- 1 000 densitometer (Alpha Innotech Corp.). Purified OMP class 3 extracted 
wild type of N. meningitidis was used as a standard. Based on the results 
(summarized in Table 3), the level of protein expression was estimated to be 
moderate to high. 

The optimization of the translation elongation rate for the expression of 
the MB3 gene (see Materials and Methods, above) was very useful. The 
modification of the starting MB3 sequence was especially effective for 
intracellular expression of the porin gene (pHIL-D2/MB3 construct). Because 
other constructs (pHIL-Sl/MB3 and pPIC9/MB3, both used for MB3 secretion) 
contained codons optimal for Pichia in the leader peptide sequence upstream of 
the MB3 insert, the initiation of translation of these cassettes was not rate- 
limiting. In contrast, the pHIL-D2/MB3 construct did not include a leader 
sequence, and so without codon optimization, translation would have had to have 
been initiated at rare codons of the MB3 insert. The codon-optimized pHIL- 
D2/MB3 construct, when transformed into Pichia chromosomal DNA, provided 
the highest level of MB3 expression of all the other mentioned MB3 expression 
constructs (Tables 3 and 4). Thus, this modification of the translation start 
sequence of MB3 appears to be responsible for the high yield of expressed protein 
in pHIL-D2/MB3 constructs. 

The level of MB3 expression by the best clones {Pichia transformed with 
the pHIL-D2/MB3 construct) was in the range of 0.1-0.6 g per 1L of cell 
suspension, or 1-3 mg per g of cell pellet (Table 3, Fig. 12). Such efficiency of 
expression in yeast has been reported for many of the following manufactured 
proteins: hepatitis B surface antigen (0.3 g/L), superoxide dismutase (0.75 g/L). 
bovine and human lysozyme (0.3 and 0.7 g/L, respectively), human and mouse 
epidermal growth factors (0.5 and 0.45 g/L respectively), human insulin-like 
growth factor (0.5 g/L), human interleukin-2 (1.0 g/L), aprotinin analog (0.8 g/L), 
Kunitz protease inhibitor (1 .0 g/L), etc. (Cregg et al. t Biotechnology, 1 7:903-906, 
Table 1 (1993)). 



It should be emphasized that all of the previously listed levels of 
expression for manufactured proteins are the result of production of these proteins 
during fermentation in high cell density fermentors. MB3 was expressed utilizing 
only shake flask cultures which, as a rule, provide much lower expression levels 
than does fermentation. Recently reported observations lead one to expect a 
much higher yield (a 5-10 fold or greater increase) of MB3 in a fermenter (Cregg 
et al , 1 993). P. pastohs adapts w ell to being scaled up from shake flask to high 
density fermentor cultures. In addition, where ^OA'-deleted Pichia strains are 
used for fermentation, production of foreign proteins can be optimized by first 
causing rapid growth, and then adding methanol to induce protein production 
while minimizing additional cell growth. The long amount of time needed to 
produce proteins when Pichia is growing on methanol can be reduced by 
applying one of several mixed-feed fermentation strategies (Siegel et al., 
Biotechnoi Bioeng. 34:403-404 (1989); Brierley et al, Int. Patent Application 
No. WO 90/03431 (1989); Brierly et al, Biochem. Eng. 559:350-362 (1990); 
Siegel et al.. Int. Patent Application No. WO 90/10697 (1990)). 

Another promising aspect of the expression levels of MB3 protein in 
Pichia is that the results were similar for all examined clones. As other 
investigators have found that in shake flask induction the level of expression is 
proportional to the number of copies of inserted gene of interest (Clare et al, 
1991), it can be deduced that all of the MB3 clones tested were single-copy 
chromosomal integrants, and thus that no Pichia recombinants with multiple 
integrated copies of the MB3 fragment were isolated. 

An important factor in obtaining high levels of expression using 
P. pastoris is the ability to obtain recombinants with multicopy transplacement 
or integration (Romanos et al. Vaccine 9:901-906 (1991); Clare et al, 
Bio/Technology 9:455-460 (1991); Clare et al, Gene 705:205-121 (1991)). . 
Multicopy transformants have been found to be surprisingly stable over multiple 
generations during growth and induction in high cell density fermentations. 
Since this multiple gene insertion event occurs at a low frequency during 
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spheroplast transformation, a special dot blot screening of a number of 
recombinants is used (Scorrer et al., Bio/Technology 72:181-184 (1993)). An 
alternative to screening for spontaneous multiple insertion events is to insert 
multiple copies of the gene(s) of interest into Pichia expression vector pA08 1 5, 
which has recently been constructed by Invitrogen for this purpose. 

Before attempting to express MB3, the protein was evaluated to determine 
if any of the factors believed to reduce expression levels were present. One of the 
factors which can reduce expected high-level accumulation of a protein is 
proteolytic stability. It is now known that highly expressed proteins are devoid 
of good PEST sequences. PEST sequences contain proline (P), glutamic acid (E), 
serine (S) and threonine (T), and are found in all rapidly degraded eukaryotic 
proteins of known sequence; such proteins have been implicated as favored 
substrates for calcium-activated proteases (Rogers et al^ Science 25-/:364-369 

(1986) ). Proteins that are expressed at high levels in yeast do not contain a so- 
called "good" PEST sequence (having a score >5 as calculated by the algorithm 
developed by Rogers et al. (1986)), which leads to susceptibility to proteolysis, 
nor do they contain the pentapeptide sequences XFXRQ or QRXFX (X=any 
amino acid), which are selective for degradation of cytoplasmic proteins by the 
lysosomal pathway (Dice, J.F., Fed. Am.Soc. Exp. Biol. (FASEB) J. /:349-357 

(1987) ). Proteins that are expressed at high levels in yeast do not contain these 
pentapeptide sequences. Computer analysis of the MB3 sequence identified a 
"poor" but not "good" PEST region (13-32aa) having the sequence 
(ETSRSVFHQNGQVTEVTTAT. According Rogers et al (1986) such a poor 
PEST sequence weakly influences the proteolytic stability of eukaryotic proteins. 
Thus, one of the factors which leads to proteolysis is not present in MB3. 

MB3 also does not contain the highly conserved pentapeptide sequences 
mentioned above. The sequence RO SFI (75-79aa) is present in MB3: this 
sequence displays some homology to the degradation pentapeptide QRXFX. but 
is not believed to greatly destabilize MB3. 
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The nature of the NH r terminal amino acid residue can also be an 
important factor in the susceptibility of a protein to degradation. Varshavsky et 
ah have demonstrated that the presence of certain amino acids at the NH r 
terminus provide a stabilizing effect against rapid degradation by ubiquitin- 
mediated pathways (the N-end rule pathway) (Varshavsky et al. Yeasi Genetic 
Engineering, Butterworths, pp. 109-143 (1989)). Most proteins that are 
expressed at high levels in yeast have a stabilizing amino-terminus amino acid 
residue (A, C, G, M, S ? T or V). Examples of such proteins include human 
superoxide disrnutase, human tumor necrosis factor, phosphoglycerate kinase 
from £ cerevisiae, invertase from S. cerevisiae, alcohol oxidase from P. pas tor is, 
and extracellular alkaline protease from Y. lipolytica (Sreekrishna et ah, 
Biochemistry 2<S:41 1 7-4125 (1989)). Although MB3 is expressed well in yeast, 
the NH r terminal aspartic acid (D) of MB3 does not provide a stabilizing effect 
against rapid degradation by ubiquitin-mediated pathways. 

It is possible that the NH r terminal aspartic acid of MB3 will play a role 
in the level of MB3 produced from Pichia in large scale production. Replacing 
the first amino acid of MB3 with one of the amino acids known to stabilize the 
NH 2 -terminus of proteins, mentioned above, could improve the level of MB3 
production. 

It was decided to proceed with experiments attempting to express MB3 
in yeast, as most of the factors known to reduce expression levels were not 
present in MB3. 

The best expression of MB 3 was provided by Pichia clones transformed 
with the pHIL-D2/MB3 expression cassette (Tables 3 and 4). This pHIL-D2 
vector generated intracellular expression of complete, monomeric, non-fusion, 
non-secreted MB3 with an expected MW of about 34 kDa. These clones 
provided the highest level of expression of MB3, up to 600 mg/L or 3 mg per g 
of wet cell pellet (Table 4). About 90-95% of this product was insoluble, 
membrane-associated material, i.e., material which sediments upon centri- 
fugation for 5 min at 10,000g, and that can be extracted by treatment with SDS- 
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containing buffer (PAGE sample buffer) followed by boiling. The protein can 
then be renatured to a conformation that can be easily recognized by an anti- 
meningococcal OMP class 3 antibody. 

Induction of pHIL-D2/MB3 constructed clones with methanol resulted 
in the rapid expression and fast accumulation of intracellular MB3. After 24 
hours of a methanol induction, the level of expressed MB3 was estimated at not 
less than 80% of maximal, which was reached after 5-6 days. 

The pHIL-D2/MB3-containing Pichia recombinant is the most promising 
for commercial production. This clone provides relatively high levels of 
expression which could be significantly improved by using multiple-copy 
recombinants, and by producing the protein in a fermentor. The fact that MB3 
is rapidly produced also provides an advantage for large scale manufacture. 

MB3 expressed in an intracellular form was purified by a 
denaturation/renaturation protocol, followed by gel filtration and ion exchange 
chromatography. The resultant purified protein exhibits an elution profile on size 
exclusion chromatography that resembles the recombinant class 3 protein 
overexpressed in E. coli. MB3 expressed by either E. coli or P. pasiohs co-elutes 
with the native wild-type counterpart, indicating that MB3 expressed by either 
E. coli or P. pasiohs refolds and oligomerizes, achieving full native conformation 
(Figs. 14Aand 14B). 

Both the native {Pichia) secretion signal (PHOl) and the alpha-factor 
signal sequence from S. cerevisiae were tested for targeting expressed porin to 
the secretary pathway. Unexpectedly, the shorter PHOl leader was more 
effective for causing MB3 secretion. The pHlL-Sl Pichia transfer vector 
includes a sequence encoding the 2.5 kDa PHOl leader peptide, a secretion signal 
peptide of P. pastoris. In the pHIL-Sl/MB3 construct, the sequence encoding 
MB3 was inserted downstream of the PHOl leader sequence. 40-50% of the 
36.5 kDa expressed fusion protein PH01/MB3 produced by pHIL-Sl/MB3 
clones was properly cleaved to generate a 34 kDa MB3 monomer (Tables 2 and 
3), and 5-10% of expressed soluble porin was secreted. The pPIC9 and pPIC9K 
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Pichia transfer vectors include a sequence encoding the 10 kDa alpha-factor 
leader derived from S. cerevisiae. Pichia clones transformed by pPIC9/MB3 or 
pPIC9K/MB3 did not secrete porin. These recombinants expressed a 44 kDa 
alpha- factor prepro/MB3 fusion protein well, but no evidence of correct 
cleavage and processing was observed. Improved secretion of expressed MB3 
was not obtained by using its 3 ; truncated fragment fused with either PHOl 
leader or alpha-factor leader peptides. 

Example 10. Isolation, purification and characterization of 

MB3 protein expressed as a secretory protein 

Yeast cells cultures harboring the expression vector containing the gene 
for MB3 (pHIL-S 1 -pN V3 1 8) were configured to isolate the protein as soluble 
secreted material). The supernatant was clarified by precipitation with 20% 
ethanol (v/v) to remove contaminating yeast culture impurities. The supernatant 
was then precipitated with 80% ethanol (v/v). The resulting pellet was washed 
with TEN buffer (Tris HC1, pH 8.0, 100 mM NaCl and 1 mM EDTA), in order 
to remove other hydrosoluble contaminating secreted proteins. The pellet 
containing MB3 was dissolved in an aqueous solution of detergent (solubilizing 
buffer), comprised of TEN buffer with 5% Z 3-14. The solution was applied to 
a Hi-Trap Q Sepharose ion exchange column (1 ml) (Pharmacia) equilibrated in 
50 mM Tris, 0.2 M NaCl and 1 .0 mM EDTA (pH 8.0). A gradient of 0.2-1 .0 M 
NaCl was applied, and MB3 protein eluted as a single peak. 

Example 11. Isolation, purification and characterization of 

MB3 protein expressed as an insoluble- 
membrane bound protein 

Yeast cells cultures harboring the expression vector containing the gene 
for MB3 (pHILD-2-pNV322) (see Table 3) were resuspended in breaking buffer 
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(i.e., 50 mM sodium phosphate buffer, pH 7.4, 1 mM EDTA, and 5% glycerol), 
to a concentration equivalent to 50-100 ODs. The suspension was added to the 
same volume of acid treated glass beads. The suspension was lysed using a 
Minibead-Beater (Biospec Products, Bartlesville, OK), in 8 consecutive cycles 
5 of 1 min each, followed by 1 min on ice, between each cycle. As an alternative 

procedure, the lysis process was facilitated by the addition of Zymolase to the 
breaking buffer. The suspension was transferred to a glass sintered filter to 
separate the glass beads, and the cell suspension was collected in the filtrate. The 
beads were further washed and the filtrates combined. The suspension was then 

10 centrifuged at 12,000 rpm for 1 5 min at 4°C. A series of consecutive washing 

steps was applied to the resultant pellet, consisting of the following: (a) TEN 
(Tris HC1, pH 8.0, 100 mM NaCl, and 1 mM EDTA) containing 0.5% 
deoxycholate; (b) TEN containing 0.1% SDS and 1% Nonidet, after which the 
suspension was rotated for 30 min at 25 °C; (c) washing with TEN buffer; and 

15 (d) washing with TEN buffer containing 5% Z 3-14. under rotation overnight at 

4°C. Each washing step was followed by centrifugation at 12,000 rpm for 10 min 
at 4°C to collect the pellet for the following step. As an alternative method of 
washing the pellet, the suspension was passed through an 1 8 gauge needle in lieu 
of rotation in steps (b) and (d). Finally, the MB3 was extracted with 8M urea, or 

20 6M guanadinium HCI, and the extract was sonicated for 10 min, using a water 

bath sonicator. The extract was clarified by centrifugation (12,000 rpm, for 10 
min at 4°C), the same volume of a 10% aqueous solution of 3,14-zwittergen 
(Calbiochem) was added and the solution thoroughly mixed. The solution was 
again sonicated for 10 min. Any residual material was removed by 

25 centrifugation. This mixture was then applied to a Sephacryl S-300 (5x100 cm) 

column (Pharmacia) equilibrated in a buffer comprised of 0.1 M Tris-HCl, 0.2 M 
NaCl, 10 mM EDTA, 20 mM CaCl 2 and 0.05% Z 3-14 (pH 8.0). Fractions 
containing class 2 protein were identified by SDS-PAGE, pooled, and applied to 
a Hi-Trap Q Sepharose ion exchange column (1 ml) (Pharmacia) equilibrated in 

30 50 mM Tris, 0.2 M NaCl and 1 .0 mM EDTA (pH 8.0). A gradient of 0.2-1 .0 M 
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NaCl was applied and MB3 protein eluted as a single peak. Figures 14A, 14B 
and 15 depict the elution profile of purified MB3 in a Sepharose 12 (Pharmacia ) 
connected to an HPLC (Hewlett Packard, model 1 090). Based on the comparison 
with the native wild-type class 3 protein, as well as calibration using molecular 
weight standards, the elution profile is indicative of trimeric assembly. 

Example 12. Preparation of GAMP- TT Conjugate 

12.1 Preparation of NMA polysaccharide for conjugation. A 

meningitidis group A (NMA) strain 604 A was grown in modified Franz medium 
(Franz, I. D., J. Bact. 73:757-761 (1942). Precipitation of the polysaccharide as 
a cationic detergent complex followed by fractional precipitation with ethanol 
provided the high molecular NMA capsular polysaccharide. The high molecular 
weight polysaccharide was further purified by ultra filtration. Partial hydrolysis 
of the polysaccharide with 100 mM sodium acetate buffer pH 5.0 at 70°C yielded 
a low molecular weight polysaccharide in the range of 10,000-20,000 daltons. 
The free reducing terminal residue of the polysaccharide was reduced with 
NaBH 4 in the cold to preserve O-acetyl substituents and then oxidized with 
sodium periodate to generate terminal aldehyde groups. The oxidized 
polysaccharide was the purified and fractionated by size exclusion 
chromatography to provide activated group A meningococcal polysaccharide 
(GAMP) of average molecular weight about 13,000 daltons. 

12.2 Preparation of GAMP-TT conjugate. Tetanus toxoid (Serum 
Statens Institute, Denmark) was first purified to its monomeric form 
(mw 1 50,000) by size exclusion chromatography using a Superdex G-200 column 
(Pharmacia). Freeze-dried tetanus toxoid monomer (1 part by weight) and 
oxidized GAMP (2.5 part by weight) were dissolved in 0.2 M phosphate buffer 
pH 7.5. Recrystallized NaBH 3 CN (1 part) was added and the reaction mixture 
incubated at 37°C for 4 days. The conjugate was purified from the free 
components by size exclusion chromatography using a Superdex G-200 column 



- 78 - 



(Pharmacia), and PBS containing 0.01% thimerosal as an eluent. Purified 
GAMP-tetanus toxoid conjugate was stored at 4°C in this buffer. The 
polysaccharide content of the conjugate based on phosphorus analysis (Chen 
assay) was about 1 8-20% by weight. 

Example 13. Preparation of GCMP- TT Conjugate 

13 J Preparation of NMC polysaccharide for conjugation. The 

capsular polysaccharide was isolated from the growth medium of Neisseria 
meningitidis group C (NMC) strain C 1 1 . This strain was grown in modified 
Franz medium. The NMC polysaccharide (group C meningococcal 
polysaccharide (GCMP)) was isolated from the culture medium by cetavlon 
precipitation as described for the GAMP. Native GCMP was O-deacety lated with 
base and depolymerized by oxidative cleavage with NaI0 4 to an average 
molecular weight of 10,000-20,000. The cleaved polysaccharide was sized and 
purified by gel filtration chromatography to provide a highly purified product of 
average molecular weight about 12,000 daltons and having aldehyde groups at 
both termini. 

13.2 Preparation of GCMP-TT conjugate. Tetanus toxoid monomer 
(1 part) and solid oxidized GCMP (1 part) were dissolved in 0.2 M phosphate 
buffer pH 7.5 and incubated at 37°C with 1 part of recrystallized NaBH 3 CN for 
4 days. The conjugate was purified from its free components by gel filtration 
chromatography on Superdex G-200 using PBS containing 0.01% thimerosal as 
eluent. The purified conjugate was stored at 4°C prior to being formulated for 
animal studies. The content of the polysaccharide in the conjugate was 33% 
based on its sialic acid content as measured by the Svennerholm resorcinol assay 
(Biochim. Biophys. Acta 244:604-61 1 (1957). 
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Example 14. Preparation of N-Propionyl Group B Meningococcal 

Polysaccharide-rPorB Conjugate 

14.1 Preparation of Neisseria rPorB. Expression of class 3 N. 
meningitidis porin protein (PorB) in E. coli and purification of porin gene 
products is described supra. The recombinant rPorB protein was purified by 
using a sephacryl S-300 molecular sieve column equilibrated with 100 mM Tris- 
HC1, 200 nM NaCl, 10 mM EDTA, 0.05% Zwittergen 3, 14 (Calbiochem. La 
Jolla, CA), 0.02% sodium azide pH 8.0. The protein fractions as measured by 
their OD 280 eluting with an apparent molecular weight of trimers were pooled and 
diafiltered against 0.25 M HEPES, 0.25 M NaCl, 0.05% Zwittergen 3, 14 pH 8.5, 
to a concentration of 10-12 mg/ml. 

14.2 Preparation of N-propionylated Group B Meningococcal 
Polysaccharide (GBMP). The N-propionylated GBMP and its oxidized form 
were prepared as described in U.S. Patent No. 4,727,136 and EPO 0504202 , both 
of which are fully incorporated by reference herein. 

14.3 Preparation of N-Pr-GBMP-rPorB conjugate. To 1 0 mg of 
oxidized N-Pr-GBMP of average molecular weight 12,000 was added 33 |Lil of a 
12 mg/ml of rPorB protein in 0.25 M HEPES, 0.25% M NaCL 0.05% Zwittergen 
3. 14, pH 8.5. The solution was mixed until all solid dissolved and 6.5 mg of 
recry stall ized NaBH 3 CN was added. The solution was incubated at 37°C for 4 
days and the conjugate was purified from the mixture by using a Superdex G-200 
column (Pharmacia) equilibrated with PBS -0.0% thimerosal. Protein fractions 
were combined and stored at 4°C. The conjugates were analyzed for their sialic 
acid content by the resorcinol assay and for protein with the Pierce Coomassie 
Plus assay. The resulting conjugate had a polysaccharide content of about 20- 
25% and is devoid of any pyrogens as measured by the LAL and rabbit 
pyrogenicity tests. 
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Example 75. Analysis of Conjugates by Capillary Electrophoresis 

15.1 System and method Analysis was performed by Capillary Zone 
Electrophoresis on a Beckman 2000 Series CE system (Beckman Instruments 
Inc., Fullerton, CA) using an untreated fused silica capillary of dimensions 47 cm 

5 total length (40 cm effective length) by 50 jim i.d. (375 \xm o.d.) and 0.4N borate 

buffer, pH 10.2 as electrolyte (Hewlett Packard, Palo Alto, CA). System control 
and data acquisition was performed using Beckman Gold system software. The 
voltage was set at 25 KV and the detector was set to 200 nm detection 
wavelength. The capillary temperature was set to 20°C. The capillary was 
10 conditioned between runs with a high pressure rinse for 2.0 minutes with 0.1 M 

sodium hydroxide followed by 2.0 minutes with deionized water. All samples 
were pressure injected. All buffer and sample media were filtered through an 
appropriate 0.2 \im membrane filter and degassed prior to use. 

15.2 Analysis of Conjugates. After purification the conjugates were 
15 concentrated by ultrafiltration through an Amicon Centricon-3 concentrator 

(Amicon, Inc., Beverly, MA). Meningococcal polysaccharide and tetanus toxoid 
monomer calibration samples were prepared in deionized water at a concentration 
of 0.25 mg/ml and 0.28 mg/ml, respectively. The method was determined to be 
selective for the glycoprotein and conjugate components with adjacent 

20 components being completely separated (Rs>1.5), as demonstrated in the 

electropherograms of the polysaccharides and protein spiked glycoprotein 
conjugates (Fig. 20 and Fig. 21). Fig. 20 shows the GAMP-TT conjugate spiked 
with GAMP and TT-monomer conjugate components, while Fig. 21 shows the 
GCMP-TT conjugate spiked with GCMP and TT-monomer conjugate 

25 components. The lower limit of detection (LLD) for the free form polysaccharide 

and protein components for the method was determined to be in the subnanogram 
level. A lower limit of quantitation (LLQ) of approximately 0.6 ng was obtained 
for the free form of each component. A linear response was obtained for the 
selected total mass of each component. A linear response was obtained for the 
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selected total mass range of 0.6-2.6 ng and 0.6-2.4 ng for the polysaccharide and 
protein, respectively, with a coefficient of determination of 0.99 for both curves. 
Using this CZE based assay, analysis of a meningococcal polysaccharide-tetanus 
toxoid conjugate indicated a free polysaccharide content of less than about 5% 
and a free protein content of less than about 2%. 

Example 16. Immunization and Immunoassays 

16.1 Trivalent conjugate vaccine formulation. Each individual 
conjugate component (A, B, C) was absorbed onto Aluminum hydroxide 
(Al(OH) 3 ) Alhydrogel (Superfos, Denmark) at a final Al concentration of 

1 mg/ml of the trivalent vaccine. Three vaccines were formulated in which the 
doses of each conjugated polysaccharide varied. Formulations had either about 

2 ^ig of each A, B, and C conjugated polysaccharide; or about 2 |ag A conjugated 
polysaccharide, about 5 \xg B conjugated polysaccharide and about 2\ig C 
conjugated polysaccharide; or about 5 \xg of each A, B, and C conjugated 
polysaccharide per dose of 0.2 ml of PBS, 0.01% thimerosal. 

16.2 Immunization. Female Balb/c mice (Charles River Laboratories) 
4-6 weeks old, were injected i.p. at days 0, 28. and 42. Bleeds were performed 
at days 0. 14, 28, and 42, and mice were finally exsanguinated at day 52. Sera 
were stored at -70°C prior to serological analysis. 

16.3 Immunoassays: 

ELISAs: Antibody titers to each A, N-propionylated B and C polysaccharides 
were determined by ELISA using the corresponding HSA conjugates as coating 
antigen (Figs. 22, 23, and 24). Antibody titer was defined as the x-axis intercept 
of the linear regression curve of absorbance vs. absorbance x dilution factor. 
Bactericidal Assays: Bactericidal assays were performed using baby rabbit 
serum as a source of complement and N. meningitidis strains H 44/76 (Serotype 
15), Cll and Al respectively used as group B meningococcal, group C 
meningococcal, and group A meningococcal organisms in this assay (Figs. 25, 
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26, and 27). Bactericidal titer was defined as the serum dilution producing 50% 
reduction in viable counts. 

Having now fully described this invention, it will be understood by those 
of ordinary skill in the art that the invention can be practiced within a wide and 
5 equivalent range of conditions, formulations and other parameters without 

affecting the scope of the invention or any embodiment thereof. All patents and 
publications cited herein are fully incorporated by reference herein in their 
entirety. 
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Table 1. ELISA and Bactericidal Titers of Group B Meningococcal 
Conjugate Vaccines (N-Pr GBMP-Protein) 



Vaccine 


Adjuvant 


ELISA 
Titer 


Bactericida 
1 Titer 




Saline 


5,400 


0 


N-Pr GBMP- 


Al(OH) 3 


13.000 


0 


TT 
1 1 


ST 1 


17,000 


0 




CFA 2 


40,000 


800 




Saline 


20,000 


500 




Saline 


22,000 


150 




Saline 


39,000 


960 




Al(OH) 3 


93,000 


200 


N-Pr GBMP-PP 


Al(OH) 3 


166.000 


>3,200 




Al(OH) 3 


130,000 


1,200 




ST 


53,000 


1.000 




ST 


29,000 


1,700 




ST 


72,000 


1.500 




Saline 


>100 


0 


N-Pr GBMP 


Al(OH) 3 


>100 


0 




ST 


>100 


0 




Saline 


>100 


0 


PP 


Al(OH) 3 


>100 


0 




ST 


660 


0 



'ST = Stearyl tyrosine. 

2 CFA = Complete Freund's Adjuvant 
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Table 2. Efficacy of a transformation of yeast (Pichia) cells 



Construct 


Number of analyzed 
transformants 


MB3 expressed transformants 




Number of positive 


"A, from total 


pHIL-D2 / MB3 


32 


9 


28 


pHIL-Sl / MBS 


23 


8 


35 


pPIC9 / MB3 


16 


4 


25 


P PIC9K / MBS 


16 


5 


31 
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Table 3. Expression of MB3 porin protein with recombinant Pichia pastoris 



AMVAX 


Clone 


Vector 


Level of expression 
mg'g mg/L 


Secretion 


pnv 3 1 1 


SI/MB3/3/S 


pHIL-SI 


ND 


20-30 


0 


pnv 312 


S1/MB3/5/S 


pHIL-SI 


ND 


30-40 


0 


pnv 313 


Sl/MB3/7/s 


pHIL-SI 


ND 


30-40 


0 


pnv 314 


S1/MB3/12/S 


pHIL-SI 


ND 


20-30 


5 - 10 


pnv 315 


S1/MB3/15/S 


pHIL-SI 


ND 


20-30 


0 


pnv 316 


S1/MB3/18/S 


pHIL-SI 


ND 


80- 100 


5 - 10 


pnv 3 1 7 


S1/MB3/22/S 


pHIL-SI 


ND 


50-60 


5-10 


pnv 318 


S1/MB3/23/S 


pHIL-SI 


ND 


300 - 400 


5 - 10 


pnv 32 1 


D2/MB3/1-7/S 


pHIL-D2 


2.4 


480 


0 


pnv 322 


D2/MB3/2-1/S 


pHIL-D2 


3.0 


600 


0 


pnv 323 


D2/MB3/2-6/S 


pHIL-D2 


1.7 


340 


0 


pnv 324 


D2/MB3/2-8/S 


pHIL-D2 


1.6 


320 


0 


pnv 325 


D2/MB3/4-I/S 


pH!L-D2 


1.7 


340 


0 


pnv 326 


D2/MB3/4-3/s 


pHIL-D2 


2.4 


480 


0 


pnv 327 


D2/MB3/4-4/S 


pHIL-D2 


2.4 


480 


0 


pnv 328 


D2/MB3/4-5/S 


pHIL-D2 


2.4 


480 


0 


pnv 329 


D2/MB3/4-26/S | 


PH1L-D2 


2.4 


480 


0 


pnv 341 


P9/MB3/1-46/S 


pPIC-9 


ND 


10-20 


0 


pnv 342 


P9/MB3/I-261/S 


pPIC-9 


ND 


80-100 


0 


pnv 343 


P9/MB3/I-263/S 


pPIC-9 


ND 


20 - 30 


0 


pnv 344 


P9/MB3/1-268/S 


pPIC-9 


ND 


20 - 30 


0 


pnv 345 


9K/MB3/Tr/3-4/s 


pPJC-9K 


ND 


150 - 200 


5 


pnv 346 


9K/MB3/Tr/3-5/s 


pPIC-9K 


ND 


100 - 150 


0 


pnv 347 


9K/MB3/Tr/3-6/s 


pPIC-9K 


ND 


100 - 150 


0 


pnv 348 


9K/MB3/Tr/3-8/s 


pPIC-9K 


ND 


80- 100 


0 


pnv 349 


9K/MB3/Tr/3-9/s 


pP!C-9K 


ND 


80- 100 


0 
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Code 
AMVAX 


Clone 


Vector 


Level of expression 
mg/g rng/L 


Secretion 


nnv 350 


9K7MB3/6-1/S 


pPIC-9K 


ND 


150 - 200 


0 


pnv 351 


9K/MB3/6-2/S 


pP!C-9K 


ND 


100-150 


o 


pnv 352 


9K/MB3/6-3/S 


pP!C-9K 


ND 


100-150 


o 


pnv 353 


9K/MB3/6-5/S 


pPIC-9K 


ND 


80-100 


o 


pnv 354 


9K/MB3/6-9/S 


r°!C-9K 


ND 


80 - 100 


o 


pnv 355 




pnv, *7i\ 


ND 


150- 200 


n 


pnv _>_>o 


7 IS./ IVII3J/"-J/b 


n pip Q|/ 


ND 


80 - 100 


A 

u 


pnv 357 


VIS/IVIDj/ I U-ZW/i 


pr l\- -VIS. 


ND 


80 - 100 


n I 

u 


pnv 358 


9K/MB3/I0-33/S 


pPIC-9K 


ND 


80- 100 


0 


pnv 359 


9K/MB3/Tr/1 1- 


pPIC-9K 


ND 


150 - 200 


0 


pnv 360 


9K/MB3/Tr/1 1- 


pPIC-9K 


ND 


150 - 200 


0 


pnv 361 


9K/MB3/Tr/1 1- 


pPIC-9K 


ND 


80- 100 


0 


pnv 362 


9K/MB3/Tr/1 1- 


pPIC-9K 


ND 


80- 100 


0 
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Table 4. The expression of MB3 by recombinant clones with different 
expression cassettes. The main characteristic of the best clones. 



Code: 


pnv318 
sl/MB3/ 
23/s 


pnv322 
D1/MB3/2- 
l/s 


pnv345 
9K/MB3/Tr/3- 
4/s 


pnv350 
9K/MB3/6- 

! i/s 


Characteristic: 
Expression vector 


pHIL-Sl 


pHIL-D2 


pPIC 9K 


pPIC 9K 


Fused leader 
peptide 


PHOl 
(2.5kDa) 


NO 


a-factor(lOkDa) 


a- 

factor(lOKDa) 


Promoter for 
MB3 


AOX1 


AOX1 


AOX1 


AOX1 


Size of expr. 
protein(s) 


34.0; 
37.5kDa 


34.0kDa 


43kDa 


44kDa 


Cleavage 
(Processing) 


Cleavage 
(40-50%) 


NO 


NO 


NO 


Secretion 


Weak, 
<10% 


NO 


NO 


NO 


MB3 degradation 


<10% 


<10% 


<10% 


<10% 


Express 
level(mg/g) 


2.0 


3.0 


2.0 


1.5 


Expression Level 
(mg/L) 


300.0 


600.0 


150.0 


150.0 


Cytosol 
localization 


60-70% 


5-10% 


50% 


50% 


Membrane 
association 


30-40% 


90-95% 


50% 


50% 


Solubility 


Partly 
soluble 


Insoluble 


Partly soluble 


Partly soluble 
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Table 5. Codon Usage for Picltia pastoris and MB3 



Pichia pastoris codon usage 



TTT 


phe 


F 


1 1 


TCT 


ser 


S 


13 


TAT 


tyr 


Y 


6 


TGT 


cys 


C 


5 


TTC 


phe 


F 


5 


TCC 


ser 


s 


9 


TAC 


tyr 


Y 


8 


TGC 


cys 


C 


*) 


TTA 


leu 


L 


3 


"TV* A 

1 LA 


ser 


c 


«■) 


T A A 
I AA 


UL H 


Z 




1 OA 


OPA 


z 




TTG 


leu 


L 


26 


TCG 


ser 


s 


-> 


TAG 


AMB 


z 




TGG 


trp 


w 


-> 

j> 


CCT 


leu 


L 


4 


CCT 


pro 


p 


6 


CAT 


his 


H 




CTG 


arg 


R 


4 


CTC 


leu 


L 


1 


CCC 


pro 


p 


5 


CAC 


his 


H 


3 


CGC 


arg 


R 


2 i 


CTA 


leu 


L 


4 


CCA 


pro 


p 


4 


CAA 


gin 


0 


12 


CGA 


arg 


R 




CTG 


leu 


L 


8 


CCG 


pro 


p 


1 


CAG 


gin 


0 


1 


CGG 


arg 


R 


2 


ATT 


ile 


1 


8 


ACT 


thr 


T 


17 


AAT 


asn 


N 


9 


AGT 


ser 


S 


6 


ATC 


ile 


1 


7 


ACC 


thr 


T 


5 


AAC 


asn 


N 


4 


AGC 


ser 


S 


I 


ATA 


ile 


1 


3 


ACA 


thr 


T 


5 


AAA 


Ivs 


K 


15 


AGA 


arg 


R 


6 


ATG 


ile 


M 


4 


ACG 


thr 


T 


1 


AAG 


lys 


K 


14 


AGG 


arg 


R 


6 


GTT 


val 


V 


15 


GCT 


ala 


A 


17 


GAT 


asp 


D 


15 


GGT 


gly 


G 


13 


GTC 


val 


V 


6 


GCC 


ala 


A 


6 


GAC 


asp 


D 


12 


GGC 


giy 


G 


5 


GTA 


val 


V 


2 


GCA 


ala 


A 


9 


GAA 


glu 


E 


23 


GGA 


gly 


G 


6 


GTG 


val 


V 


!0 


GCG 


ala 


A 


1 


GAG 


glu 


E 


1 1 


GGG 


gly 


G 





Outer membrane group B porin protein class 3 (MB3 ) codon usage 



TTT 


phe 


F 


2 


TCT 


ser 


S 


8 


TAT 


tyr 


Y 


4 


TGT 


cys 


C 




TTC 


phe 


F 


1 1 


TCC 


ser 


S 


7 


TAC 


tyr 


Y 


1 1 


TGC 


cys 


c 




TTA 


leu 


L 


1 


TCA 


ser 


s 




TAA 


OCH 


Z 


1 


TGA 


OPA 


z 




TTG 


leu 


L 


1 1 


TCG 


ser 


s 


4 


TAG 


AMB 


Z 




TGG 


trp 


w 


4 


CCT 


leu 


L 


2 


CCT 


pro 


p 


2 


CAT 


his 


H 


2 


CTG 


arg 


R 


4 


CTC 


leu 


L 




CCC 


pro 


p 


3 


CAC 


his 


H 


7 


CGC 


arg 


R 


8 


CTA 


leu 


L 




CCA 


pro 


p 




CAA 


gin 


0 


10 


CGA 


arg 


K 




CTG 


leu 


L 


7 


CCG 


pro 


p 




CAG 


gin 


0 


4 


CGG 


arg 


R 


1 


ATT 


ile 


I 


5 


ACT 


thr 


T 


5 


AAT 


asn 


N 


6 


AGT 


set 


S 




ATC 


ile 


1 


7 


ACC 


thr 


T 


7 


AAC 


asn 


N 


12 


AGC 


ser 


S 


9 


ATA 


ile 


1 




ACA 


thr 


T 




AAA 


Ivs 


K 


21 


AGA 


arg 


R 


1 


ATG 


met 


M 


2 


ACG 


thr 


T 


1 


AAG 


lys 


K 


-> 


AGG 


arg 


R 




GTT 


val 


V 


10 


GCT 


aia 


A 


4 


GAT 


asp 


D 


9 


GGT 


giy 


G 


1 


GTC 


val 


V 


5 


GCC 


ala 


A 


7 


GAC 


asp 


D 


12 


GGC 


gly 


G 


2 


GTA 


val 


V 


9 


GCA 


ala 


A 


9 


GAA 


glu 


E 


1 1 


GGA 


gly 


G 


I 


GTG 


val 


V 


7 


GCG 


ala 


A 




GAG 


glu 


E 


4 


GGG 


gly 


G 
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What Is Claimed Is: 

1 . A method for the high level expression of the outer membrane 
meningococcal group B porin protein or a fusion protein thereof in yeast, 
comprising: 

(a) ligating into a plasmid having a selectable marker a gene 
coding for a protein selected from the group consisting of: 

(i) a mature porin protein 

(ii) a fusion protein which is a mature porin protein fused 
to a yeast secretion signal peptide; 

wherein said gene is operably linked to a yeast promoter; 

(b) transforming said plasmid containing said gene into a yeast 
strain; 

(c) selecting the transformed yeast by growing said yeast in a 
culture medium allowing selection of said transformed yeast; 

(d) growing the transformed yeast, and 

(e) inducing expression of said protein to give yeast containing 
said protein; 

wherein the protein so expressed comprises more than about 2% of the total 
protein expressed in said yeast. 

2. The method according to claim 1, wherein the protein so 
expressed comprises about 3-5% of the total protein expressed in said yeast. 

3. The method according to claim 1, wherein said mature porin 
protein is the Neisseria meningitidis mature outer membrane class 3 protein from 
serogroup B. 
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4. The method according to claim 1 , wherein said yeast promoter is 
the AOX1 promoter. 

5. The method according to claim 1, wherein said yeast secretion 
signal peptide is selected from the group consisting of the secretion signal of the 

5 S. cerevisiae a-mating factor prepro-peptide and the secretion signal of the P. 

pastoris acid phosphatase gene. 

6. The method according to claim 1 , wherein said plasmid is selected 
from the group consisting of pHIL-D2, pHIL-Sl, pPIC9 and pPIC9K. 

7. The method according to claim 1 , wherein said gene comprises 
10 a nucleotide sequence that incorporates codons optimized for yeast codon usage. 

8. The method according to claim 7, wherein said codons optimized 
for yeast codon usage are in the 5' region of said gene. 

9. The method according to claim 8, wherein said 5' region of said 
gene is the nucleotide sequence: 

15 5'-gac gtC acT Ttg tac ggT acT att aaG gcT ggT gtT gaG act tec cgc tct gta tit cac cag 

aac ggc caa gtt act gaa gtt aca-3 1 . 

10. The method according to claim 8, wherein said yeast is P. 
pastoris. 

1 1 . The method of claim 1 wherein said yeast secretes said protein or 
20 fusion protein into a growth medium. 
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12. The method of claim 1 1 wherein said plasmid is selected from the 
group consisting of pHIL-Sl, pPIC9, and pPIC9K. 

13. A method of purifying the outer membrane meningococcal group 
B porin protein or fusion protein thereof obtained according to the method of 

5 claim 1 comprising: 

(a) lysing said yeast obtained in step (d) to release said protein 
or fusion protein as an insoluble membrane bound fraction; 

(b) washing said insoluble membrane bound fraction obtained 
in step (a) with a buffer to remove contaminating yeast cellular 

10 proteins; 

(c) suspending and dissolving said insoluble membrane bound 
fraction obtained in step (b) in an aqueous solution of a 
denaturant; 

(d) diluting the solution obtained in step (c) with a detergent; 
15 and 

(e) purifying said protein or fusion protein by gel filtration and 
ion exchange chromatography. 

14. A method of purifying the outer membrane meningococcal group 
B porin protein or fusion protein thereof obtained according to the method of 
claim 1 1 comprising: 

(a) centrifuging said yeast culture which has 
expressed the protein to isolate the protein as soluble secreted 
material; 

(b) removing contaminating yeast culture impurities from 
the soluble secreted material obtained in step (a) by precipitating 
said impurities with about 20% ethanol, wherein the soluble 
secreted material remains in the soluble fraction; 



20 



25 
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(c) precipitating the secreted material from the soluble fraction 
of step (b) with about 80% ethanol; 

(d) washing the precipitated material obtained in step (c) with 
a buffer to remove contaminating yeast secreted proteins; 

(e) suspending and dissolving the precipitated material 
obtained in step (d) in an aqueous solution of detergent; and 

(f) purifying the protein by ion exchange chromatography. 

1 5. A yeast host cell that contains a gene coding for a protein selected 
from the group consisting of: 

(a) a mature porin protein 

(b) a fusion protein which is a mature porin protein fused to a 
yeast secretion signal peptide. 

1 6. The yeast host cell of claim 1 5, wherein said yeast contains more 
than one copy of said gene. 

1 7. The yeast host cell of claim 1 5 wherein said mature porin protein 
is the Neisseria meningitidis mature outer membrane class 3 protein from 
serogroup B. 

18. The yeast host cell of claim 17 wherein said plasmid is selected 
from the group consisting of pHIL-D2, pHIL-Sl, pPIC9, pPIC9K and pA0815. 

19. The yeast host cell of claim 15, wherein said yeast is P. pastoris. 

20. The yeast host cell of claim 1 5, wherein the 5 1 region of the gene 
encoding said protein is encoded by the nucleotide sequence: 
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5'-gac gtC acT Ttg tac ggT acT att aaG gcT ggT gtT gaG act tec cgc tct gta ttt cac cag 
aac ggc caa gtt act gaa gtt aca-3'. 

21. A nucleotide sequence coding for an outer membrane 
meningococcal group B porin protein, wherein at least one codon has been 
changed to optimize yeast codon usage. 

22. The nucleotide sequence of claim 21, wherein said porin protein 
is the mature outer membrane class 3 protein from serogroup B, and said codon 
changes are selected from the group of changes consisting of: (GTT to GTC at 
positions 4-6 of the native sequence), (ACC to ACT at positions 7-9 of the native 
sequence), (CTG to TTG at positions 10-12 of the native sequence), (GGC to 
GGT at positions 1 6-1 8 of the native sequence), (ACC to ACT at positions 1 9-2 1 
of the native sequence), (ATC to ATT at positions 22-24 of the native sequence), 
(AAA to AAG at positions 25-27 of the native sequence), (GCC to GCT at 
positions 28-30 of the native sequence), (GGC to GGT at positions 31-33 of the 
native sequence), (GTA to GTT at positions 34-36 of the native sequence), (GAA 
to GAG at positions 37-39 of the native sequence); 

wherein said positions are numbered from the first nucleotide of the native 
nucleotide sequence encoding said protein. 

23. A vaccine comprising group A meningococcal polysaccharide 
(GAMP), group B meningococcal polysaccharide (GBMP), and group C 
meningococcal polysaccharide (GCMP) antigens, together with a 
pharmaceutical^ acceptable carrier. 

24. The vaccine of claim 23, wherein said group A meningococcal 
polysaccharide (GAMP), group B meningococcal polysaccharide (GBMP), and 
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group C meningococcal polysaccharide (GCMP) antigens are each conjugated to 
a protein carrier. 

25. The vaccine of claim 24, wherein said protein carrier to which 
said GBMP antigen is conjugated is class 3 N. meningitidis porin protein (PorB). 

26. The vaccine of claim 24, wherein said protein carrier to which 
said GAMP antigen and said GCMP antigen are conjugated is tetanus toxoid. 

27. The vaccine of claim 25, wherein said GBMP antigen is N- 
propionylated prior to being conjugated to PorB. 

28. The vaccine of claim 24 wherein said vaccine comprises about 
2 |ig of the GAMP, GCMP and GBMP polysaccharide antigen conjugates. 

29. The vaccine of claim 24, wherein said vaccine comprises about 
5 jag of the GAMP, GCMP and GBMP polysaccharide antigen conjugates. 

30. The vaccine of claim 24, wherein said vaccine comprises about 
2 jig of the GAMP and GCMP polysaccharide antigen conjugates, and about 5 ng 
of the GBMP polysaccharide antigen conjugate. 

31. A method of inducing an immune response in a mammal, 
comprising administering a vaccine comprising group A meningococcal 
polysaccharide (GAMP), group B meningococcal polysaccharide (GBMP). and 
group C meningococcal polysaccharide (GCMP) antigens, together with a 
pharmaceutically acceptable carrier, in an amount sufficient to induce an immune 
response in a mammal. 
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32. The method of claim 31, wherein said group A meningococcal 
polysaccharide (GAMP), group B meningococcal polysaccharide (GBMP), and 
group C meningococcal polysaccharide (GCMP) antigens are each conjugated to 
a protein carrier. 



5 



33. 



The method of claim 31, wherein said mammal is a human. 
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AT66ACGTTACGTTGTAC66TACAATTAAA6CA6GCGTAGAA6TTTCTCGCGTAAAAGAT6CT6GTACAT 70 
•TACCTGCAATGGAACATGCCATGTTAATTTGGTCCGCATCTTCAAAGAGCGCATTTTCTACGACCATGTA 

MDVTL YGT IKAGVEVSR VKOAG.T 

ATAAAGCTCAAGGCGGAAAATCTAAAACTGCAACCCAAATTGCCGACTTCGGTTCTAAAATCGGTTTCAA llir, 
TATTTCGAGTTCCGCCTTTTAGATTTTGACGTTGGGTTTAACGGCTGAAGCC AAGATTTTAGCCAAAGTT 

YKAQG GKSKTATQ I A D F G S K I G F K 

AGGTCAAGAAGACCTCGGCAACGGCATGAAAGCCATTTGGCA6TTGGAAC AAAAAGCCTCCATCGCCGGC 2 1 0 
TCCAGTTCTTCTGG AGCCGTTGCCGTACTTTCGGTAAACCGTCAACCTTGTTTTTCGGAGGTAGCGGCCG 
G Q E DLGNGMKA I WQLEQ K AS I AG 

ACTAACAGCGGCTGGG6TAACCGCCAGTCCTTCATCGGCTTGAAAGGCGGCTTCGGTACCGTCCGCGCGG 280 
TGATTGTCGCCGACCCCATTGGCGGTCAGGAA6TAGCCGAACTTTCC6CC6AAGCCATGGCAGGCGCG6C 

TNSGVGNRQSFI GL KGGFGTVRA 

GTAATCTGAACACCGTATTGAAAGACAGCGGCGACAACGTCAATGCATGGGAATCTGGTTCTAACACCGA 350 
CATTAGACTTGTGGCATAACTTTCTGTCGCCGCTGTTGCAGTTACGTACCCTTAGACCAAGATTGTGGCT 

GNLNTVLKDSGDNVNAWE SGSNTE 

AGATGTACTGGGACTGGGTACTATCGGTCGTGTAGAAAGCCGTGAAATCTCCGTACGCTACGACTCTCCC 420 
TCTACATGACCCTGACCCATGATAGCCAGCACATCTTTCGGCACTTTAGAGGCATGCGATGCTGAGAGGG 

D V L G L G TI GRV ES R E I S V R Y D S P 

GTATTTGCAGGCTTCAGCGGCAGCGTACAATACGTTCCGCGCGATAATGCGAATGATGTGGATAAATACA 490 
CATAAACGTCCGAAGTCGCCGTCGCATGTTATGCAAGGCGCGCTATTACGCTTACTACACCTATTTATGT 

VF AGFSGSVQYVPRDNANDVDKY 

AACATACGAAGTCCAGCCGTGAGTCTTACCACGCCGGTCTGAAATACGAAAATGCCGGTTTCTTCGGTCA 560 
TTGTATGCTTCAGGTCGGCACTCAGAATGGTGCGGCCAGACTTTATGCTTTTACGGCCAAAGAAGCCAGT 

KHTK SSRESYHAGLK YENA GF FGQ 

ATACGCAGGTTCTTTTGCCAAATATGCTGATTTGAACACTGATGCAGAACGTGTTGCAGTAAATACTGCA 630 
TATGCGTCCAAGAAAACGGTTTATACGACTAAACTTGTGACTACGTCTTGCACAACGTCATTTATGACGT 

YAG SF AKY ADLNTDAERVAVNTA 

AATGCCCATCCTGTTAAGGATTACCAAGTACACCGCGTAGTTGCCGGTTACGATGCCAATGACCTGTACG 700 
TTACGGGTAGGACAATTCCTAATGGTTCATGTGGCGCATCAACGGCCAATGCTACGGTTACTGGACATGC 

NAHPVKDYQVHRVVAGYDANDL Y 

TTTCTGTTGCCGGTCAGTATGAAGCTGCTAAAAACAACGAGGTTGGTTCTACCAAGGGTAAAAAACACGA 770 
AAAGACAACGGCCAGTCATACTTCGACGATTTTTGTTGCTCCAACCAAGATGGTTCCCATTTTTTGTGCT 

VSVAGQYEAAKNNEVGSTKGKKHE 

GCAAACTCAAGTTGCCGCTACTGCCGCTTACCGTTTTGGCAACGTAACGCCTCGCGTTTCTTACGCCCAr 8^0 
CGTTTGAGTTCAACGGCGATGACGGCGAATGGCAAAACCGTTGCATTGCGGAGCGCAAAGAATGCGGGTG 

QTQVAATAAYRFGN'VTPRVSYAH 
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GFKAKVN6VK0ANYQYDQVI V6 A 

ACTACGACTTCTCCAAACGCACTTCC6CTCTGGTTTCT6CCG6TTG6TT6AAACAA66TAAAGGCGCG6G 980 
TGAT6CTGAAGAGGTTTGCGTGAAGGCGAGACCAAAGACGGCCAACCAACTTTGTTCCATTTCC6CGCCC 

DYDFSKRTSALVSAGWLKQGKGAG 
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Fig. The production levels of the expressed MB3 
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MBaypnvl-5/pET24A -> 1 -phase Translation 

DNA sequence 942 b.p. ATGgacgttacc ... cacaaattctaa linear 

1/1 31/11 

ATG gac gtt acc ccg tac ggc acc att aaa gca ggc gta gaa act tec cgc tct gta ttt 
net asp val thr leu tyr gly thr ile lys ala gly val glu thr ser arg ser val phe 
61 / 21 91 / 31 

cac cag aac ggc caa gtt act gaa gtt aca acc get acc ggc ate gtt gat ttg ggt teg 
his gin asn gly gin val thr glu val thr thr ala thr gly ile val asp leu gly ser 
121 / 41 151 / 51 

aaa ate ggc ttc aaa ggc caa gaa gac etc ggt aac ggc ctg aaa gec att tgg cag gtt 
lys ile gly phe lys gly gin glu asp leu gly asn gly leu lys ala ile trp gin val 
181 / 61 211 / 71 

gag caa aaa gca tct ate gec ggt act gac tec ggt tgg ggc aac cgc caa tec ttc ate 
glu gin lys ala ser ile ala gly thr asp ser gly trp gly asn arg gin ser phe ile 
241 / 81 271 / 91 

ggc ttg aaa ggc ggc ttc ggt aaa . ttg cgc gtc ggt cgt ttg aac age gtc ctg aaa gac 
gly leu lys gly gly phe gly lys leu arg val gly arg leu asn ser val leu lys asp 
301 / 101 331 / 111 

acc ggc gac ate aat cct tgg gat age aaa age gac tat ttg ggt gta aac aaa att gec 
thr gly asp ile asn pro trp asp ser lys ser asp tyr leu gly val asn lys ile ala 
361 / 121 391 / 131 

gaa ccc gag gca cgc etc att tec gta cgc tac gat tct ccc gaa ttt gec ggc etc age 
glu pro glu ala arg leu ile ser val arg tyr asp ser pro glu phe ala gly leu ser 
421 / 141 451 / 151 

ggc age gta caa tac gcg ctt aac gac aat gca ggc aga cat aac age gaa tct tac cac 
gly ser val gin tyr ala leu asn asp asn ala gly arg his asn ser glu ser tyr his 
481 / 161 511 / 171 

gec ggc ttc aac tac aaa aac ggt ggc ttc ttc gtg caa tat ggc ggt gee tat aaa aga 
ala gly phe asn tyr lys asn gly gly phe phe val gin tyr gly gly ala tyr lys arg 
541 / 181 571 / 191 

cat cat caa gtg caa gag ggc ttg aat att gag aaa tac cag att cac cgt ttg gtc age 
his his gin val gin glu gly leu asn ile glu lys tyr gin ile his arg leu val ser 
601 / 201 631 / 211 

ggt tac gac aat gat gee ctg tac get tec gta gec gta cag caa caa gac gcg aaa ctg 
gly tyr asp asn asp ala leu tyr ala ser val ala val gin gin gin asp ala lys leu 
661 / 221 691 / 231 

act gat get tec aat teg cac aac tct caa acc gaa gtt gec get acc ttg gca tac cgc 
thr asp ala ser asn ser his asn ser gin thr glu val ala ala thr leu ala tyr arg 
721 / 241 751 / 251 

ttc ggc aac gta acg ccc cga gtt tct tac gec cac ggc ttc aaa ggt ttg gtt gat gat 
phe gly asn val thr pro arg val ser tyr ala his gly phe lys gly leu val asp asp 
781 / 261 811 / 271 

gca gac ata ggc aac gaa tac gac caa gtg gtt gtc ggt gcg gaa tac gac ttc tec aaa 
ala asp ile gly asn glu tyr asp gin val val val gly ala glu tyr asp phe ser lys 
841 / 281 871 / 291 

cgc act tct gec ttg gtt tct gee ggt tgg ttg caa gaa ggc aaa ggc gaa aac aaa ttc 
arg thr ser ala leu val ser ala gly trp leu gin glu gly lys gly glu asn lys phe 
901 / 301 931 / 311 

gta gcg act gee ggc ggt gtc ggt ctg cgc cac aaa ttc taa 
val ala thr ala gly gly val gly leu arg his lys phe ©eH- 



WO 97/28273 PCT/US97/01687 



"fa 



fff DNA Strider 1.0 §M Wednesday, January 17, 1996 8=17x35 PM 



Hcn.Cl«ss3 opt. -> 1-phase Translation 



DNA sequence 942 b.p. ATGgacgtCacT ... cacaaattctaa linear 
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xhcUZ^b^w— > List { ^ 322$ 

CA sequence 91S6 b.p. AGATCgcggccg ... CCCCCCAGGCAG 



I 10 I 20 I 30 

1 AGATCgeggc cgcgatctaa catccaaaga 
6 1 CGACATCCAC ACCTCCATTC TCACACATAA 
121 GCAOCAGACC GTTGCAAACG CACGAC CTCC 
181 CATOGAAAAA CCAOCCCAGT TATTGGGCTT 
241 ATTAOGCTAC TAACACCATG ACTTTATTAG 
301 ATOTTTOTTT ATTTCOGAAT GCAACAAOCT 
361 AOOOCTTTCT CACTCT G OOG TCAAATACTT 
421 TTTAAACOCT GTCTTGGAAC CTAATATGAC 
481 CTTTQCTTCC TTGAAATGCT AACGCCCACT 
S41 TACOGTTTGT CTTCTTTC CT ATTGATTCAC 
601 AGCGCACTCT CTCTATCCCT TCTGAACCCG 
661 AACAACCCGC TTTTTOGATG ATTATGCATT 
721 GGTGGGAATA CTGCTGATAG CCTAACGTTC 
781 CTTGACAGGC AATATATAAA CAGAAGGAAG 
841 TCATTATTAG CTTACTTTCA TAATTGCGAC 
901 CGACTTTTAA CCACAACTTG AGAAGATCAA 
961 CATGgacgcc actttgtacg gtactattaa 
1021 tcaccagaac grgccaagtta ctgaagttac 
1081 gaaaatcggc ttcaaaggcc aagaagacct 
1141 cgagcaaaaa gcatctaccg ccggtactga 
1201 cggcttgaaa ggcggcttcg gtaaattgcg 
1261 caccggcgac atcaatcctt gggatagcaa 
1321 cgaacccgag gcacgcctca tttccgtacg 
1381 cggcagcgta caatacgcgc ttaacgacaa 
1441 cgccggettc aactacaaaa acggtggctt 
1501 acatcatcaa gtgcaagagg gcttgaatat 
1S61 cggttacgac aatgatgccc tgtacgcttc 
1621 gactgacgct tccaattcgc acaactctca 
1681 efctcggcaac gtaacgcccc gagtttctta 
1741 tgcagacata ggcaacgaat acgaccaagt 
1801 acgcactccc gccttggttt ctgccggttg 
1861 cgtagcgact gccggcggtg ttggtctgcg 
1921 ACTCTTCCTC ACTTCAACTT GGGCATTACG 
1981 GACGATGTCA GAATGOCATT TGCC TGACAG 
2041 TTGTAACCTA TATAGTATAG CATTTTTTTT 
2101 TCCTGATCAG CCTATCTCGC AGCTGATGAA 
2161 TCCACTTTGA T GTTTTTCTT CGTATTTCCC 
2221 AGAAGTTCGT TTGTGCAAGC TTATCGATAA 
2281 TTCCTAACCC AGTCAGGCAC CGTCTATGAA 
2341 CACCGTCACC CTGGATGCTG TAGGCATAGG 
2401 GCGGGATATC GTCCATTCCG ACAGCATCGC 
2461 TGCGTTGATG CAATTTCTAT CCCCACCCGT 
2521 CCGCCCACTC CTCCTCCCTT CGCTACTTGG 
2581 CACACCCGTC CTGTGGATCT ATCGAATCTA 
2641 TAAGTCCC A G TTTCTCCATA CGAACCTTAA 
2701 CAGCACCCAG ATCCATCACT CCTTGCCCAA 
2761 GTGAAGTGAT GAACTTCTGG AAGGTTGCAG 
2821 CGTACCTTGG CAAACTGTCG TTGCTACCGG 
2881 AGGCACCAAC AAACACAGAT CCAGCGTGTT 
2941 CGATTTGCAG GATCAAGTGT TCAGGAGCCT 
3001 ACGTTGCAAC CGATAOGGTT CTAGAGTGTG 
3061 GCAACTGCAC ACCTTGCTTC TGAACAGCAT 
3121 TATCCACACC CAACAGAATC ACCTGGGAAT 
3181 CTCAGGCAAC GAAATCTGGA TCAGCGTATT 
32 41 CACCAGGCAT CTCAATACTA CACAGGGCTG 
3301 CAGTAACCAA CTCGTTTCCT GGACCAAATA 
3361 CCTAAGC C A T ACCACCTACT GCCTGGGCGC 
3 421 CCTTGTGGCC AACGTAGATG ACTTCTGGCC 
34 81 CAAAAACAAT TTCTTTGCAA CCAGC AACTT 
3 541 AACGCAGAAT TGCGCTTCCA CCAGGAATAT 
3601 AACGACACCA GACTACACCA GGGCAAGTCT 
3661 CATCCAATTT CCTCACCTTA TCTATAGACA 
3721 ATTGCATAAC TTCCTCTGCG AAAGGAGCTT 
37 81 CAAACTTCCC ACTTACTTCT AAAACCGCTT 



I 40 | 50 I 60 

OGAAAGGTTG AATCAAACCT TTTTGCCATC 60 
CTGCCAAACC CAACAGGAGG CGA TACACT A 120 
ACTCCTCTTC TCCTCAACAC CCACTTTTGC 180 
CATTGGAGCT CGCTCATTCC AATTCCTTCT 240 
OCTCTCTATC CTGGCCCCCC TOGCGAGCTC 300 
OCOCATTACA OCCGAACATC ACTOCACATG 360 
TCATGTTCCC AAATGCCCCA AAACTGACAG 420 
AAAAGCCTGA TCTCATOCAA GATGAACTAA 480 
TCGTCAAAAA GAAACTTCCA AAAGTCGCCA 540 
GAATCCTCAA AAATAATCTC ATTAATGCTT 600 
GTGGCACCTC TOCCGAAACC CAAAT CCOGA 660 
GTCCTCCACA TTGTATCCTT CCAACATTCT 720 
ATGATCAAAA TTTAACTGTT CTAACC CCTA 780 
CTCCCCTCTC TTAAACCTTT TTTTTTATCA 640 
TGGTTCCAAT TCACAAGCTT TTGATTTTAA 900 
AAAACAACTA ATTATTCGAA ACGAGGAATT 960 
ggctggtgtt gagacttccc gctctgtatt 1020 
aaccgctacc ggcatcgttg atttgggttc 1080 
cggtaacggc ctgaaagcca tttggcaggt 1140 
ctccggctgg ggcaaccgcc aatccctcat 1200 
cgtcggccgt ttgaacagcg tcctgaaaga 1260 
aagcgactat ttgggtgtaa acaaaattgc 1320 
ctacgattct cccgaatttg ccggcctcag 1380 
tgcaggcaga cataacagcg aatcttacca 1440 
cttcgtgcaa tacggcggtg cctataaaag 1500 
tgagaaatac cagattcacc gtttggtcag 1560 
cgtagccgta cagcaacaag acgcgaaact 1620 
aaccgaagtt gccgctacct tggcataccg 1680 
cgcccacggc ttcaaaggtt tggttgatga 1740 
ggttgtcggt gcggaatacg acttctccaa 1800 
gttgcaagaa ggcaaaggcg aaaacaaatt 1860 
ccacaaattc taaGAATTCC CTTAGACATG 1920 
AGAACACCCC TCTTCCTAGA T TCTAATCA A 1980 
ATCCAGCCTT CATTTTTGAT ACTTTTTTAT 2040 
GTCATTTTGT TTCTTCTCCT ACCAGCTTGC 2100 
TATCTTGTGG TAGGGGTTTG GGAAAATCAT 2160 
ACTCCTCTTC ACAGTACACA AGATTAAGTC 2220 
CCTTTAATGC GGTAGTTTAT CACAGTTAAA 2280 
ATCTAACAAT GCCCTCATCC TCATCCTCCG 2340 
CTTCGTTATG CCGGTACTGC CGGGCCTCTT 2400 
CAGTCACTAT GGCGTGCTGC T AGCGC TATA 2460 
TCTCCGAGCA CTCTCCGACC CCTTTCGCCG 2520 
ACCCACTATC CACTACCCGA TCATGGCCAC 2580 
AATGTAAGTT AAAATCTCTA AATAATTAAA 2640 
CAGCATTCCG GTCAGCATCT AGACCTTCAA 2700 
TATGTTTCAG TCCCTCACGA GTTACCTCTT 2760 
TCTTAACTCC CCTGTATTGA CCGGCATATC 2820 
AGGACTAATC TCCACAACTC TCTGGACACT 2880 
CTACTTCATC AACATAACAA CAACCATTCT 2940 
ACTCATTGGA CATTTCCAAA GCCTGCTCGT 3000 
CAATACACTT GCGTACAATT TCAACCCTTG 3060 
CTTCAATTCT GGCAACCTCC TTGTCTCTCA 3120 
CAATACCATG TTCACCTTCA CACAGAAGGT 3180 
TATCAGCAAT AACTAGAACT TCAGAAGGCC 3240 
ATGTGTCATT TTGAACCATC ATCTTGGCAC 3300 
TTTTGTCACA CTTACCAACA GTTTCTGTTC 3360 
CTCCTCCTAG CACCATACAC TTAGCACCAA 3420 
TAAGGGTACC ATCCTTCTTA GGTCGAGATG 3 4 B0 
TGGCAGGAAC ACCCACCATC AGGCAACTCG 3S40 
AG AGGCC A AC TTTCTCAATA CCTCTTGCAA 3600 
CAACTTGCAA CGTCTCCCTT ACTTGACCTT 3660 
GATCAATCCC TCTCTTAACC TTATCTGCCA 3720 
CTAACACACC TCTCTTCAAA CCGACTCCAT 3 7 BO 
TCTCACCATT TTCACCAACA TTCTCCACAA 3840 
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3841 TTCGTTTCAC TAATTCCATA ATCTOTTOCO 
3901 CAATTTCTTO TQA CCAQOCC 1TAGAAACOT 
3961 AAGGGACTTC TTTAQOTTTO CATICTTCTT 
4021 CATCTCCTTT CCTTCTAOTO A CCTTTA OOC 
4081 OCAAOCTCAC ACCCTACTTO OCACATCTAA 
4141 CCCAOGCTAT ATCTTCCTTG OATTTAOCTT 
4201 TAGCGTTCAA CAAAACTZCG TOOTCAAATA 
4261 TGCTCTTACC ATCCCACAAG GTOOCTTCCA 
4221 ACCAACTCCG TTCCAAGTGA CACAAAOCAA 
4381 ACTCTCCATC ACAATCCAAT TCCATACCCA 
4441 CTTTATACCA CAAACCCTCA COACOACATT 
4501 COOGAATACA CTTTTTOGAC GACTACACCA 
4561 CCAAAGTACT CAATAGACCA TCGGOOCOGT 
4621 TGACAGGGAA CTTTTTGACA TCTTGAGAAA 
4681 CAATAATGGG CATTATAOCA CAAGCAACAG 
4741 CAGAAAAACC ATAAACAGTT CTACTA00GC 
4801 GAGAACAAAA ACGCACAGCG ATACTAGCAT 
4861 GGGTCCTATA CATAACCCTA CCCCCTCCCA 
4921 CTACGTCCAA AATCACTTCA TTGATAOCAT 
4981 GCTCCTCAAA TTGGTOCTCT GTAACGGATG 
S041 TCCATTCACT CAACTTCATC AGCTTCTCCA 
5101 TTCCTACCAA ACTCAAGGAA TTATCAAACT 
SI 61 GAAATGTCAT ACTTCAAGTC GGACAGTGAG 
5221 TTTATTATCA GTGAGTCAGT CATCAGGAGA 
5281 TCACCCGCCC CACAGGTGCG GTTGCTCGCG 
5341 ATCGGGCTCG CCACTTCCGC CTCATGAGCG 
5401 CCGTGGCCGG GGGACTGTTG CGCGCCATCT 
5461 TCCTCAACGC CCTCAACCTA CTACTGGGCT 
5521 ACCCTCCACT ATCTATCATT CGAAGTATCG 
5561 TGAGCTCTCC TATCACATTA TCCCCAACTA 
S641 ATTTCTCTGA CTTTTGCTCA TCACTACACT 
5701 CAGAAATGTC CTTCTTCCAC ACACTAAATG 
S761 CACGAACAAA CTTCTTCTTT CCAACTTTTT 
S821 ATA TGTC CGG TAGGAATCGA GCGGGCAAAT 
SB81 AGCCTTTGTA CATACTGATC CCAACTTCAG 
5941 CCGAATCCAC AGAAATCAAA GTTGTTTGTC 
6001 AACT GACAAT AGTGTGCTCG TCTTTTGACC 
6061 ATCTAAATAA TCTTCACGAG CCAAGGCCAT 
6121 GTTAAAACGA CAAGTATCTC TCOCTCTATT 
6181 CATCAACTTG ACGGGCACTA TCTTGTTTTA 
6241 AAAGGTACGC TGATTTTAAA CCTGAAATTT 
6301 ATAACTCTTA TTTTTCAGTG TTCCCCATCT 
6361 CCTTATCCAT GATAACCTCT CAAACATGAC 
6421 AGAAATCTTG AAGACGAAAC GGCCTCGTGA 
64 81 TAA TAAT CCT TTCTTACACC TCAGGTGGCA 
6541 TTTCTTTATT TTTCTAAATA CATTCAAATA 
6601 AAATGCTTCA ATAATATTGA AAAAGGAAGA 
6661 TTATTCCCTT TTTTGCCGCA TTTTCCCTTC 
6721 AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG 
6781 ACAGCGGTAA CATCCTTCAC AGTTTTCCCC 
6841 TTAAAGTTCT CCTATGTGGC GCGGTATTAT 
6901 GTCCCCGCAT ACACTATTCT CAGAATGACT 
6961 ATCTTACGCA TGGCATCACA CTAAGAGAAT 
7021 ACACTGCCGC CAACTTACTT CTGACAACGA 
7081 TGCACAACAT CGCGCATCAT CTAACTCCCC 
7141 CCATACCAAA CGACGACCGT CACACCACGA 
7201 AACTATTAAC TGCCGAACTA CTTACTCTAG 
7261 ACGCGCATAA AGTTGCACGA CCACTTCTCC 
7321 CTCATAAATC TGCACCCCGT GAGCGTOGCT 
7381 ATCGTAACCC CTCCCCTATC CTAGTTATCT 
74 41 AACCAAATAC ACAGATCGCT GAGATAGCTC 
7501 ACCAACTTTA CTCATATATA CTTTACATTG 
7S61 AAATTCGCGT TAAATTTTTG TTAAATCACC 
7621 AAAATCCCTT ATAAATCAAA AGAATAGACC 
7681 AACAACACTC CACTATTAAA CAACGTGGAC 
7741 CACCGCCATC CCCCACTACG TCAACCATCA 
7801 CGTAAACCAC TAAATCGGAA CCCTAAAGGG 
7861 CCCCCGAACC TGCCCACAAA GGAAGGGAAG 
7921 GCAACTCTAC CGCTCACCCT GCGCGTAACC 
7981 CACCCCCCCT AAAAGGATCT ACCTCAAGAT 
8041 TTAACCTCAC TTTTCGTTCC ACTCAGCCTC 
8101 TTGACATCCT TTTTTTCTCC CCCTAATCTC 



TTTTCIOOAT AOGACCAOGA ACCOCATCTT 39O0 
CAATTTTOCA CAATTCAATA OGACCTTCAG 3960 
TAOGTTCT7C CTTCGTQTAT CCTOOCTTO O 4020 
ACTTCATATC CAQGTTTCTC TOCACCTOCT 4080 
CTAATCCAAA ATAAAATAAG TCACCACATT 4140 
CTCCAACTTC ATCAGCTXCC TOOCTAATTT 4200 
AOCGTTTOGT ATAACAACCT TCTOOAGCAT 4260 
TCGCTCTAAG ACCCTTTGAT TGGCCAAAAC 4320 
CACCTCTTTC T7CAACCACA AATTTCAAGC 4380 
CCAACTTTTO AGTTCCTCCA GATCTACCAC 4440 
CGTACACTCC AGTTTCTCTC CTTATAGCCT 4500 
OOCCCAACCA GTAATTAGAA GAGTCACCCA 4560 
CAOTAGTCAA ACACCCCAAC AAAATTTCAC 4620 
GTTCCTATTC AGTAGTCAAT TCCCGAGCAT 4680 
TOGAACTCAC ATCTACCAAC TTTCCCGTCT 4740 
CATTACTCAA ACTTTTCAAA TCCCCCACTG 4800 
TACCGGGCAA GGATCCAACT TTATCAACCA 4860 
TCATCCTTTC CACAACTCTT TCTGCCAAAT 4920 
TATTCTACAA CTTCAGCAAG TTGTCGATCA 4980 
ACTCAACTTC CACATTAACT TCAAGCTCAG 5040 
CCTGGTCACC ACCATACCGA AACACGGCTT 5100 
CTGCAACACT TGCGTATCCA CGTAGCAAGG 5160 
TCTAGTCTTG AGAAATTCTG AACCCGTATT 5220 
TCCTCTACCC CCGACCCATC GTCCCCGCCA 5280 
CCTATATCCC CCACATCACC CATCCGGAAC 5340 
CTTGTTTCCG CCTGGCTATG GTGGCAGCCC 5400 
CCTTCCATGC ACCATTCCTT CCGGCCCCGC S460 
CCTTCCTAAT CCAGGACTCG CATAACCGAC 5520 
CAATGCTCAT ACCCCCATTC TTCACTCTCT 5580 
AAGCAACCCG ACGAGGACA7 TTCATCCTAA 5640 
CCAACTCTCA CACTATCTCC CTTATGACAC 5700 
AAGTCCCACC AATAAAGAAA TCCTTGTTAT 5760 
CG C T CC CT T G AACTATAAAA TCTAGAGTCC 5820 
OCTTACCTTC TGCACCTTCA AGAGGTATCT 5880 
TCACAACGTT CCTATTTCCT TCAAACCATT 5940 
TACTATTCAT CXrAACCCACT GCCGTCTTCA 6000 
TCATCTTTGT ATCAATAAAT CTACTCTTTG 6060 
AAATACCCAA ATCTAAAACT CTTTTAAAAC 6120 
AAACCCCAAA TCACCTCGTA CTCTGATCCT 6180 
GAGAAATTTG CCGACATGCG ATATCCAGAA 6240 
ATCTCAACAT CgcggccGCC ATCTCGAATA 6300 
CCGTCTATTT CACAATACCA ACATGACTCA 6360 
AATTAATTCG ATGATAAGCT CTCAAACATG 6420 
TACCCCTATT TTTATAGGTT AATCTCATCA 6480 
CTTTTCCCCC AAATGTGCGC CGAACCCCTA 6540 
TGTATCCCCT CATCACACAA TAACCCTCAT 6600 
CTATGAGTAT TCAACATTTC CCTCTCCCCC 6660 
CTGTTTTTCC TCACCCAGAA ACCCTCCTGA 6720 
CACCACTGGG TTACATCGAA CTCGATCTCA 6780 
CCGAACAACG TTTTCCAATC ATGACCACTT 6840 
CCCCTCTTCA CCCCGGGCAA CAGCAACTCC 6900 
TCCTTCAGTA CTCACCAGTC ACACAAAACC 6960 
TATCCACTGC TGCCATAACC ATGACTCATA 7020 
TCGCAGGACC GAAGGAGCTA ACCGCTTTTT 7080 
TTCATCCTTG GG AACCCGAC CTGAATGAAG 7140 
TCCCTGCACC AATGCCAACA ACGTTGCGCA 7200 
CTTCCCCCCA ACAATTAATA GACTCGATCG 7260 
GCTCGOCCCT TCCCCCTCGC TGGTTTATTG 7320 
CTCGCCGTAT CATTGCAGCA CTCCCGCCAC 7380 
ACACGACCGG GAGTCACCCA ACTATGCATC 74 40 
CCTCACTGAT TAAGCATTCG TAACTCTCAC 7500 
ATTTAAATTG TAAACGTTAA TATTTTCTTA 7560 
TCATTTTTTA ACCAATAGGC CGAAATCGGC 7620 
GAGATACCGT TGACTGTTGT TCCACTTTGC 7 680 
TCCAACCTCA AACCGCGAAA AACCCTCTAT 774 0 
CCCTAATCAA CTTTTTTGCG CTCCAGCTGC 7800 
ACCCCCCCAT TTAGAGCTTC ACGCGGAAAG 7860 
AAAGCGAAAC GACCCGCCGC TAGGCCGCTG 7920 
ACCACACCCC CCCCCCTTAA TCCCCCCCTA 7 980 
CCTTTTTGAT AATCTCATCA CCAAAATCCC 804 0 
ACACCCCCTA CAAAAGATCA AAGCATCTTC 8100 
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CTCCTTCCAA ACAAAAAAAC CACCCCTACC 8160 
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8221 
•281 
8241 
•401 
8461 
8S21 
8S81 
8641 
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8761 
8821 
8881 
8941 
9001 
9061 
9121 



JUX30 CT C O TT 
CAOCAOAOCO 
CAAOAACTCT 



TCTTTGCCCO 
CAOATACGAA 
GTACCACCCC 
CATAAGTCGT 



CTACACOGAA 
OAGAAAGCCC 
OCTTOCAGGC 
TGAGOCTCGA 
CGCGGCCTTT 
GTTATCCCCT 
CCOCAGCCGA 
O0GX3TATTTT 
TACAATCTGC 
TGCGTCATGG 



CTGAGATACC 
GACAGOTATC 
GCAAACCCCT 
TTTTTGTGAT 
TTACCGTTCC 
CAT T CTGTQC 
ACGACCGACC 
C TC CT T ACGC 
TCTCATCCCG 



AGGTTTTCAC 
I 10 



CATCCQCTTA 
CGTCATCACC 
I 20 



A TCAAOAQCT 
ATACTCTOCT 
CTACATACCT 
GTCTTAOO0O 
C QQ O G QCTT C 
TACACCGTCA 
OOOTAAQOOC 
GOTATCTTTA 
CCTCCTCACG 
TGGCCTTTTG 
ATAAOCGTAT 
OCAGCCACTC 
ATCTCTGCCC 
CATACTTAAC 
ACACCCOOCA 
CAGACAACCT 
GAAACGOGCG 
( 30 



AOCAACTCTT 
TCTAOTCTAG 
OGCTCTOCTA 
dTCCACTCA 
CTOCACACAG 
OCATTGAGAA 
CAOGCTCGCA 
TACTOCTGTC 
OOOGCOGAGC 
CTOGOCTTTT 
TACCGCCTTT 
ACTGAGCGAC 
TATTTCACAC 
OCACTATACA 
ACAOCO0CTX3 
CTGACCCTCT 
AGCCAC 

I 40 



TTTCO0AACC 
OC CTACTTA O 
ATCCTCTTAC 
AGACCATAGT 
CCCACCTTCO 
ACCGCCACOC 
ACACGAGAGC 
GOCTTTO3CC 
CTATCGAAAA 
OCTCACATCT 
GACTGAGCTC 
CAACCGGAAC 
CCCATATGGT 
CTCCCCTATC 
ACCCOCCCTG 
CCGGGAGCTC 

I 50 



TAACTCCCTT 8220 
OCCACCACTT 8280 
CACTGCCTCC 8340 
TACOOOATAA 8400 
AGCGAACGAC 8460 
TTCCOGAACO 8520 
CCACGACCCA 8580 
ACCTCT CA CT 8640 
ACGCCAGCAA 8700 
TCTTTCCT g C 8760 
ATACCCCTCG 8620 
AGCGCCTCAT 8880 
GCACTCTCAG 8940 
GCTACCTCAC 9000 
AOCG GC TT GT 9060 
CATGTGTCAG 9120 
9156 

I 60 
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1 AGATCTAACA TCCAAAGAOG AAAGGTTCAA 
61 CTCCATTCTC ACACATAACT COCAAACGCA 
121 TOCAAACGCA GGACCTCCAC TCCTCTTCTC 
181 ACCCCACTTA TTGGGCTTGA TTGGAGCTCG 
241 ACACCATGAC TTTATTAQCC TCTCTATCCT 
301 TTCCGAATCC AACAAOCTCC CCATTACACC 
361 CTO TOCCCTC AAATACTTTC ATC TTOOCAA 
421 CTTGGAACCT AATATGACAA AAGCCTGATC 
481 G AAATG CTAA CGOCCAGTTC OICAAAAAGA 
541 TCTTTGGTAT TGATTGACGA ATCCTCAAAA 
601 CTATCCCTTC TGAACCOOCT OOCACCTCTG 
€61 TTTGGATGAT TATGCATTCT CCTCCACATT 
721 CCTCATAGCC TAACGTTCAT CATCAAAATT 
781 TATATAAACA GAAGGAACCT GCCCTCTCTT 
641 TACTTTCATA ATTGCGACTC GTTCCAATTG 
901 ACAACTTCAG AAGATCAAAA AACAACTAAT 
961 CCTTGGAAAT TATTTTAGCT TTGCCTACTT 
1021 tgtacggtac tatcaagoct ggtgttgaga 
1081 aagttactga agtcacaacc gctaccggca 
1141 aagrgccaaga agacctcggt aacggccCga 
1201 ctatcgccgg tactgactcc ggttggggca 
1261 gcttcggtaa attgcgcgtc ggtcgtttga 
1321 atccttggga tagcaaaagc gactatttgg 
1381 gcctcatttc cgtacgctac gattctcccg 
1441 acacgctcaa cgacaatgca ggcagacata 
1501 acaaaaacgg tggcttcttc gtgcaacatg 
1561 aagagggcct gaatattgag aaataccaga 
1621 atgccctgta cgcttccgta gccgtacagc 
1681 attcgcacaa ctctcaaacc gaagttgccg 
1741 cgccccgagc ttcttaegcc cacggcttca 
1801 acgaatacga ccaagtggtt gtcggtgcgg 
1861 tggtttctgc cggtcggttg caagaaggca 
1921 gcgg tgt cgg tctgcgCcac aaattccaaT 
19B1 TCAA CTTG GC CATTACGAGA AGACCGCTCT 
2041 TGCCATTTGC CTCAGAGATG CAGGCTTCAT 
2101 AGTATAGG A T TTTTTTTGTC ATTTTGTTTC 
2161 ATCTCCC ACC TGATGAATAT CTTGTGGTAG 
2221 TTTTCTTCGT ATTTCCCACT CCTCTTCAGA 
2281 TGCAAGCTTA TCGATAACCT TTAATCCGCT 
2341 CAGGCACCGT GTATGAAATC TAACAATCCC 
2401 CATCCTGTAC CCATAGGCTT GGTTATGCCC 
2461 CATTCCGACA GCATCGCCAG TCACTATGGC 
2521 TTTCTATGCG CACCCGTTCT CGGAGCACTC 
2581 CTCGCTTCGC TACTTOGAGC CACTATCGAC 
2641 TGGATCTATC CAATCTAAAT GTAAGTTAAA 
2701 CTCCATACCA ACCTTAACAG CATTCCOCTG 
2761 CATCACTGCT TGCCCAATAT GTTTCAGTCC 
2821 CTTCTGGAAG GTTGCAGTGT TAACTCCCCT 
2881 AGTGTGCTTC GTACCOGAGG AGTAATCTCC 
2941 CACAGATCCA GCGTGTTGTA CTTGATCAAC 
3001 CAAGTGTTCA GGAGCGTACT GATTGGACAT 
3061 TAGGGTTGTA GAGTGTGCAA TACACTTGCG 
3121 TTCGTTGTGA ACAGCATCTT CAATTCTGCC 
3181 CAGAATCACC TGGGAATCAA TACCATCTTC 
3241 ATCTGGATC A GCCTATTTAT CAGCAATAAC 
3301 AATA CTACA C AGCCCTGATG TGTCATTTTG 
3361 CTTTCCTGGA CCAAATATTT TGTCACACTT 
3421 AGCTACTGCC TGGGCGCCTC CTCCTAGCAC 
3481 GTAGATGACT TCTCGCCTAA GGCTACCATC 
3 541 TTTGCAACCA GCAACTTTCG CAGGAACACC 
3601 CCTTCCACCA GGAATATAGA CGCCAACTTT 
J 661 TACACCACCC CAACTCTCAA CTTCCAACCT 
3721 CACGTTATCT ATACAGACAT CAATCCCTCT 
37 81 CTCTCGGAAA GCACCTTCTA ACACACCTCT 



(40 (50 I 60 

TCAAACCTTT TTCCCATCCG ACATCCACAO 60 
ACAGGAGGGG ATACACTACC AGCAGACCGT 120 
CTCAACACCC ACTTTTGCCA TOGAAAAACC 180 
CTCATTCCAA TTCCTTCTAT TAGGCTACTA 240 
OCCCCCCCTG GCGAGCTCAT CTTTOTTTAT 300 
CGAACATCAC TCCAGATGAG OGCTTTCTGA 360 
ATGGOOCAAA ACTGACACTT TAAACOCTCT 420 
TCATCCAACA TGAACTAAGT TTGCTTCCTT 480 
AACTTCCAAA AGTCGCCATA OW1W11T 540 
ATAATCTCAT TAATGCTTAG CGCAGTCTCT 600 
CCGAAACGCA AATGGGGAAA CAACCCCCTT 660 
GTATGCTTCC AACATTCTCG TGGGAATACT 720 
TAACTCTTCT AACCCCTACT TGACAGGCAA 780 
AAACCTTTTT TTTTATCATC ATTATTAGCT 84 0 
ACAAGCTTTT CATTTTAACG ACTTTTAACG 900 
TATTCGAAAC CATCTTCTCT CCAATTTTCT 960 
TCCAATCTGT CTTCCCTCCA gacgtcactt 1020 
cttcccgctc tgtatttcac cagaacggcc 10B0 
tcgttgattt gggttcgaaa atcggcttca 1140 
aagccatttg gcaggtcgag caaaaagcat 1200 
accgccaatc cttcatcggc ttgaaaggcg 1260 
acagcgtcct gaaagacacc ggcgacatca 1320 
gtgtaaacaa aattgccgaa cccgaggcac 1380 
aatttgccgg ccccagcggc agcgtacaat 1440 
acagcgaatc ttaccacgcc ggettcaact 1500 
Gcggtgccta caaaagacat catcaagtgc 1560 
ttcaccgctt ggtcagcggt tacgacaatg 1620 
aacaagacgc gaaactgact gatgcttcca 1680 
ccacctcggc ataccgcttc ggcaacgtaa 1740 
aaggtttggt tgatgatgca gacataggca 1800 
aacacgactt ccccaaacgc acttctgcct 1860 
aaggcgaaaa caaattcgta gcgactgccg 1920 
CTGGATCCTT AGACATGACT CTTCCTCAGT 1980 
TCCTAGATTC TAATCAAGAC CATCTCACAA 2040 
TTTTGATACT TTTTTATTTG TAACCTATAT 2100 
TTCTCCTACC AGCTTGCTCC TGATCAGCCT 2160 
GGCTTTCCGA AAATCATTCG AGTTTGATCT. 2220 
CTACAGAAGA TTAAGTCAGA AGTTCGTTTG 2280 
ACTTTATCAC ACTTAAATTC CTAACCCACT 2340 
CTCATCCTCA TCCTCCGCAC CGTCACCCTC 2400 
CTACTCCCGG GCCTCTTCCG GGATATCGTC 2460 
GTCCTGCTAC CGCTATATCC CTTGATGCAA 2520 
TCCGACCCCT TTGGCCGCCC CCCAGTCCTG 2580 
TACGCCATCA TGGCGACCAC ACCCGTCCTG 2640 
ATCTCTAAAT AATTAAATAA GTCCCAGTTT 2700 
AGCATCTAGA CCTTCAACAG CACCCAGATC 2760 
CTCACGACTT ACCTCTTCTC AAGTGATGAA 2820 
CTATTGACCG GCATATCCGT ACGTTGGCAA 2 8 B0 
ACAACTCTCT GGAGACTACG CACCAACAAA 2940 
ATAAGAAGAA GCATTCTCGA TTTGCAGGAT 3000 
TTCCAAAGCC TGCTCGTAGG TTGCAACCCA 3060 
TACAATTTCA ACCCTTGCCA ACTGCACAGC 3120 
AAGCTCCTTG TCTCTCATAT CGACAGCCAA 3180 
AGCTTGAGAC ACAAGCTCTC AGGCAACGAA 3240 
TACAACTTCA CAACCCCCAC CACCCATCTC 3300 
AACCATCATC TTCGGAGCAG TAACGAACTG 3360 
ACGAACACTT TCTCTTCCCT AAGCCATAGC 3420 
GATACACTTA GCACCAACCT TCTGGGCAAC 34 B0 
CTTCTTACGT CGAGATGCAA AAACAATTTC 354 0 
CACCATCACC CAACTCGAAC GCACAATTCC 3 600 
CTCAATACGT CTTCCAAAAC CACACCAGAC 3 660 
CTCCCTTACT TG ACCTTC A T GGAATTTCCT 37 20 
CTTAACCTTA TCTCGCAATT CCATAAGTTC 3780 



CTTCAAAC-C ACTCCATCAA ACTTCGCACT 38 40 
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3841 TAGTTCTAAA ACOGCTTTCT CACCAtTVTC 
3901 V1CCA TAATC TOTTOOOT TT TCTOCATAGG 
3961 OQAG O C C TT A CAAAOGTC AA TTTTOCACAA 
4021 AOGTTTGCAT TCTTCTTTAC CTTGTTOCTT 
4081 TCTAGTGACC TTTAGOGACT TCATATOCAC 
4141 CTACTTGGCA CATCTAACTA ATOCAAAATA 
4201 TICCT T O OAT TTA GC T TC T O CAACTTCATC 
4261 AACTTCCTCC TCAAATAACC CTTTOCTATA 
4321 CCACAAGGTC CCTTCCATCG CTCTAAGACC 
4381 CAACTGACAC AAACCAACAC CTCTTTCTTC 
4441 ATCCAATTCG ATACCCAGCA ACTTTTGAGT 
4501 ACCCTGACCA CGAGATTGGT AGACTCCACT 
4561 TTTGGACCAG TACACCAGGC OCAACGAGTA 
4621 TAGACCATCG GGGCGCTCAG TACTCAAAGA 
4681 TTTGACATCT TCAGAAAGTT OCTATTCACT 
4741 TATACCACAA GCAACAGTGG AACTCACATC 
4801 AACAGTTCTA CTACCOCCAT TACTGAAACT 
4861 CACAGCGATA CTAGCATTAG OGGGCAAOGA 
4921 AACCCTAGCC CCTGGGATCA TCCTTT0GAC 
4981 CACTTCATTG ATACCATTAT TCTACAACTT 
SO 41 CTCCTCTGTA ACGGATGACT CAACTTGCAC 
S101 CTTCATCAGG TTCTCCAGCT OGTCAGCACC 
5161 CAACGAATTA TCAAACTCTG CAACACTTCC 
S221 TCAACTCCCA CACTGACTGT AGTCTTGAGA 
5281 ACTCACTCAT CAGGACATCC TCTACGCCGG 
5341 AGCTGCGGTT GCTGGCCCCT ATATCGCCGA 
S401 CTTCGGGCTC ATCAGCGCTT GTTTCGGCCT 
5461 ACTCTTCCCC CCCATCTCCT TGCATGCACC 
5S21 CAACCTACTA CTCGGCTGCT TCCTAATCCA 
5581 TATCATTG G A AGTATCGGAA TCGTGATACC 
5641 CAGATTATCC CCAACTAAAC CAACCGGAGG 
5701 TTCGTCATCA GTACACTCCA ACTGTGACAC 
5761 CTTGCACACA CTAAATGAAG TCCCACCAAT 
5821 CTTGTTTCCA ACTTTTTCCG TGCCTTGAAC 
5881 GAATGGACCC GGCAAATCCT TACCT T CT GG 
5941 ACTCATGCCA ACTTCAGTGA CAACCTTGCT 
6001 AATCAAAGTT GTTTGTCTAC TATTGATCCA 
6061 CT CC TCGTCT TTTCAGCTCA TCTT T C TA TC 
6121 TGACCACCCA AGGCGATAAA TACCCAAATC 
6181 CTATCTCTCC CTGTATTAAA CCCCAAATCA 
6241 GGCACTATCT TCTTTTACAG AAATTTOCCG 
6301 TTTTAAACGT GAAATTTATC TCAAGATCTC 
6361 AACCTCTGAC ACATGCAGCT CCCGGAGACG 
6421 ACCAGACAAG CCCGTCAGGG CGCGTCAGCC 
64 81 ACCCAGTCAC GTACCGATAC OGGAGTGTAT 
6541 TTGTACTGAG ACTGCACCAT ATGCGGTCTG 
6601 ACCCCATCAG GCGCTCTTCC OCTTCCTCGC 
6661 TCCCCCGACC GCTATCACCT CACTCAAAGG 
6721 ATAACCCAGG AAACAACATC TGACCAAAAG 
€781 CCGCGTTCCT CCC GTTTTT C CATAGGCTCC 
6841 CCTCAAGTCA GACGTGCCGA AACCCGACAG 
6901 GAAGCTCCCT CGTGCGCTCT CCTCTTCCCA 
6961 TTCTCCCTTC CGGAAOCGTG OCCCTTTCTC 
7021 TGTAGGTCCT TCGCTCCAAG CTCGGCTCTC 
7081 CCGCCTTATC CGCTAACTAT CCTCTTGACT 
7141 TGGCAGCAGC CACTCGTAAC AGCATTACCA 
7201 TCTTGAAGTG CTGCCCTAAC TACGGCTACA 
7261 TGCTGAAGCC AGTTACCTTC GGAAAAAGAG 
7321 CCGCTGGTAG CGGTCGTTTT TTTCTTTGCA 
7381 CTCAAGAAGA TCCTTTCATC TTTTCTACCG 
7 441 CTTAAGGGAT TTTCCTCATG AGATTATCAA 
7501 CCCTCTAGCG GCGCATTAAG CGCGGCGGGT 
7561 CTTCCCAGCG CCCTAGCCCC CCCTCCTTTC 
7621 CCCGG CTTT C CCCGTCAACC TCTAAATCGC 
7681 TTACCGCACC TCCACCCCAA AAAACTTCAT 
77 41 CCCTGATAGA CCGTTTTTCG CCCTTTGACC 
7801 TTGTTCC A A A CTGGAACAAC ACTCAACCCT 
7861 ATTTTGCCGA TTTCGGCCTA TTGGTTAAAA 
7921 AATTTTA AC A AAATATTAAC CTTTACAATT 
7981 TTCGTCTCAC ACTTACCAAT GCTTAATCAC 
8041 TCGTTCATCC ATACTTCCCT CACTCCCCCT 
8101 ACCATCTCCC CCCACTCCTC CAATCATACC 
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AOGAACATTG TCCACAATTC CTTTCACTAA 3900 
AOOAOGAAGG CCATCTTCAA TTTCTTCTOA 3960 
TTCAATACGA CCTTCAGAAO GCACTTCTTT 4020 
OGTCTATCCT OOCTTGOCAT CXCCTTTOCT 4080 
CTTTCTCTCC ACCTCGTOCA ACGTCACACC 4140 
AAATAACTCA OCACATICCC AGOCTATATC 4200 
AGCTTCCTCC CTAATTTTAG CGTTCAACAA 4260 
AGAACCTTCT GGAGCATTCC TCTTACGATC 4320 
CTTTGATTGG CCAAAACAGG AAGTGCGTTC 4380 
AACCACAAAT TTCAAGCAGT CTCCATCACA 4440 
TCCTCCAGAT CTAGCACCTT TATACCACAA 4500 
TTOT0 TCC TT ATAGCCTCCG CAATACACTT 4560 
ATTAGAAGAC TCAGOCACCA AAGTAGTGAA 4620 
CGCCAACAAA ATTTCACTCA CAGGGAACTT 4680 
AGTCAATTCC CGAGCATCAA TAATGGOCAT 4740 
TACCAACTTT GCGCTCTCAG AAAAAOCATA 4800 
TTTCAAATCG CCCACTOGAG AAGAAAAAGG 4860 
TCCAACTTTA TCAACCAGCC TCCTATAGAT 4920 
AA CTC TT T CT OCCAAATCTA GGTCCAAAAT 4980 
GACCAAGTTC TCGATCAGCT CCTCAAATTG 5040 
ATTAACTTGA AGCTCACTCG ATTCACTCAA 5100 
ATAGGGAAAC ACCC CTTTTC CTACCAAACT 5160 
CTATGCAGGT AGCAAGCGAA ATGTCATACT 5220 
AATTCTGAAG CCGT ATTTTT ATTATCAGTG 5280 
ACGCATCCTC GCCGGCATCA CCGGCGCCAC 5340 
CATCACCCAT GGGG AAGATC GGGCTCCCCA S400 
GGCTATGGTG GCACGCCCCG TGGCCGGCGG 5460 
ATTCCTTGCG CCCGCGCTCC TCAACCCCCT 5520 
CGAGTCCCAT AAGGGAGACC CTCGACTATC 5580 
CGC AT T CTT C ACTCTCTTGA GGTCTCCTAT 5640 
ACCACATTTC ATGGTAAATT TCTCTCACTT 5700 
TATCTCGGTT ATCACAGCAG AAATCTCCTT 5760 
AAAGAAATCC TTCTTATCAG CAACAAACTT 5820 
TATAAAATCT AGAGTGGATA TGTCGGGTAC S880 
ACCTTCAAGA GGTATCTAGG CTTTCTAGAT 5940 
ATTTCCTTCA AACCATTCCG AATCCACAGA 6000 
AGCCACTCCG CTCTTGAAAC TCACAATACT 6060 
AATAAATCTA GTCTTTGA TC TAAATAATCT 6120 
TAAAACTCTT TTAAAACGTT AAAAGGACAA 6180 
GCTCCTACTC TGATCCTCAT CAACTTGAGG 6240 
AGATGCGATA TCGAGAAAAA CGTACGCTGA 6300 
TGCCTCGCCC CTTTCGCTGA TGACGGTGAA 6360 
CTCACAGCTT CTCTGTAAGC GGATGCCCCG 6420 
GGTGTTCGCG CCTCTCGGGG CCCAGCCATG 6480 
ACTGCCTTAA CTATGCGGCA TCAGACCAGA 6540 
AAATACCGCA CAGATGCGTA AGGAGAAAAT 6600 
TCACTCACTC GCTGCGCTCG GTCCTTCGGC 6660 
CGGTAATACC GTTATCCACA GAATCACGCG 6720 
CCCACCAAAA GGCCACGAAC CGTAAAAAGG 6780 
CCCCCCCTGA CGACCATCAC AAAAATCGAC 6840 
CACTATAAAC ATACCAGGCG TTTCCCCCTG 6900 
CCCTGCCCCT TACCGGATAC CTCTCCCCCT 6960 
AATGCTCACC CTGTAGGTAT CTCAGTTCCC 7020 
TGCACGAACC CCCCCTTCAG CCCGACCOCT 7080 
CCAACCCCCT AAGACACGAC TTATCGCCAC 7140 
GAGCGACCTA TCTAGGCCGT CCTACAGAGT 7200 
CTAGAACCAC AGTATTTCGT ATCTGCGCTC 7260 
TTGGTAGCTC TTGATCCGCC AAACAAACCA 7320 
AGCACCAGAT TACGCGCAGA AAAAAAGGAT 7380 
CCTCTCACCC TCAGTGGAAC GAAAACTCAC 7440 
AAAGGATCTT CACCTAGATC CTTTTACGCC 7500 
CTGGTCCTTA CGCGCAGCCT GACCCCTACA 7560 
C CTTTCTT CC CTTCCTTTCT CCCCACGTTC 7620 
GGGCTCCCTT TACCGTTCCC ATTTACTCCT 7680 
TACCCTCATG CTTCACGTAC TGGGCCATCG 7740 
TTCGAGTCCA CGTTCTTTAA TAGTGGACTC 7800 
ATCTCGCTCT ATTCTTTTCA TTTATAAGGG 7860 
AATGAGCTCA TTTAACAAAA ATTTAACGCG 7920 
TAAATCAATC TAAAGTATAT ATG ACT AAA C 7980 
TCACGCACCT ATCTCAGCCA TCTCTCTATT 8040 
CCTCTACATA ACTACCATAC. CCCAGCCCTT 8100 
GCCAGACCCA CGCTCACCCC CTCCACATTT 8160 



it, 1-713 
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6161 
•221 
•281 
•341 
•401 
•461 
•521 
•581 
•641 
6701 
•761 
8821 
8881 
8941 
9001 
9061 
9121 
9181 



ATCA0CAATA 
Q C X C TOCATC 
TAOTTTCCOC 
TATOGCTTCA 
GTOCAAAAAA 
AOTGTTATCA 
AAGATCCTTT 
OCCACCGACT 
TTTAAAACTC 
OCTCTTGAGA 
TACTTTCACC 
AATAAGGGCG 
CATTTATCAG 
ACAAATACGG 
TATTATCATG 
TTTCTCATCT 
TAAOCTOACT 
CTTATTATTC 
I 10 



AA CCAGC CAG 
CAGTCXATTA 
AACCTTOTTC 
TTCAOCTOOO 
GOGGTTAGCT 
CTCATGCTTA 
TCTGTGACTG 
TCCTCTTOCC 
CTCATCATTG 
TCCAGTPCCA 
AGCGTTTCTG 
ACACCCAAAT 
CGTTATTGTC 
GTTCCCCOCA 
ACATTAACCT 
TTGACAGCTT 
CATCTTGCTA 
C 

I 20 



O0COAAOC3CC 

Mvarrroaca 

OCATTOCTCC 
CTTCOCAAOC 
OCTTCGGTCC 
TGOCACCACT 
CTGACTACTC 
QCQCCTCAAC 
GAAAACGTTC 
TGTAACOCAC 
OGTGAOCAAA 
GTTQAATACT 
TCATCAOCGC 
CATTT00CCG 
ATAAAAATAG 
ATCATCGAAT 
TTGTGAAATA 

I 30 
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OOAGCCCACA 
QGAAOCTAGA 
AGGCATCGTG 
ATCAAOGCGA 
TCCGATOCTT 
GCATAATTCT 
AACCAACTCA 
ACGGGATAAT 
TTCOGCGCCA 
TCGTOCACCC 
AACAGGAAGG 
CATACTCTTC 
ATACATATTT 
AAAAGTGCCA 
OCCTATCACG 
TAATTCTCAT 
CACGCAGATC 



I 



40 



AOTOCTOCT C 
GTAAGTACTT 
GTGTCAOGCT 
CTTACAT0AT 
GTCA CAAGT A 
CTTACTX3TCA 
TTCTGACAAT 
ACCCOGOCAC 
AAACTCTCAA 
AACTGATCTT 
CAAAATCCCG 
CTTTTTCAAT 
CAATGTATTT 
CCTCA CGTC T 
AGGCCCTTTC 
GTTTGACACC 
GGGAACACTG 

I 50 



CAACTTTATC 
CGOCAGTTAA 
CGTCGTTTOG 
COCCCATGTT 
AGTrOGCOOC 
TCCCATCCGT 
AGTGTATOCG 
ATAGCAGAAC 
CGATCT TACC 
CACCATCTTT 
CAAAAAAGGG 
ATTATTGAAG 
AGAAAAATAA 
AAGAAACCAT 
CTCTTCAAGA 
TTATCATCGA 
AAAAATAACA 



8220 
8280 
8340 
8400 
8460 
8520 
8S80 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9191 
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BH* sequence 8974 b.p. ACATCTAACATC ACTTATTATTCG linear 

I 10 I 20 I 30 I 40 I SO I 60 

1 AGATCTAACA TCCAAAOACC AAACGTTCAA TGAAACCTTT TTGCCATCCG ACATCCACAG 60 
61 GTCCATTCTC ACACATAAGT CCCAAACGCA ACAGGAGCCO ATACACTAGC AGCAGACCGT 120 
121 TCCAAACCCA GGACCTCCAC T C CTCTTCT C CTCAACACCC A CTTTTOCC A T0GAAAAACC 180 
181 AGCCGACTTA TTOOGCTTOA TTOGACCTCG CTCATTCCAA TTCCTTCTAT TAGGCTACTA 240 
241 ACACCATGAC TTTATTACCC 1CTCTATCCT CCCCCOCCTG OCGACGtTCA TCTTTCTTTA 300 
301 TTTCCOAATG CAACAAOCTC O0CATTACAC CCCAACATCA CTCCAGATGA GGOCTTTCTG 360 
361 AGTGTGGCCT CAAATAGTTT CATOTTOCCc AAATCCCCCA AAACTGACAC TTTAAACGCT 420 
421 CTCTTGGAAC CTAATATGAC AAAAGCCTCA TCTCATCCAA GATGAACTAA CTTTSGTTCG 480 
481 TTOAAATGCT AACOGCCAGT TOCTCAAAAA GAAACTTOCA AAACTCGCCA TACCGTTTGT 540 
541 CTTCTTTGCT ATTGATTOAC CAATOCTCAA AAATAATCTC ATTAATCCTT ACCGCAGTCT 600 
601 CTCTA TCGCT TCTGAACCCG CTGGCAOCTC TOCCGAAACG CAAATCOGCA AACaCCCGCT 660 
661 TTTTOGATGA TTATGCATTG TcTCCACATT CTAT GC TTCC AAGATTCTGG TOGGAATACT 720 
721 CCTCATACCC TAACGTTCAT CATCAAAATT TAACTGTTCT AACCCCTACT TGACAGCAAT 780 
781 A TATA AACAG AAGCAAGCTC OCCTOTCTTA AAC CTTTTTT TTTATCATCA TTATTAGCTT 840 
841 ACTTTCATAA TTCCCACTGC TTCCAATTCA CAAGCTTTTG ATTTTAACCA CTTTTA ACGA 900 
901 CAACTTCACA AGATCAAAAA ACAACTAATT ATTCCAAGGA TCCAAACGat gAGATTTCCT 960 
961 TCAATTTTTA CTCCAG TTTT ATTCCCAGCA TCCTCCGCAT TACCTCCTCC AGTCAACACT 1020 
1021 ACAACAGAAG ATGAAACGGC ACAAATTCCG GCTGAAGCTG TCATCGCTTA CTCAGATTTA 1080 
1081 CAAGGGGATT TCGATCTTGC TCTTTTGCCA TTTTCCAACA GCACAAATAA CCGGTTATTG 1140 
1141 TTTAT AAA T A CTACTATTGC CAGCATTCCT GCTAAAGAAG AAGCCCTATC TCTCGAGAAA 1200 
1201 ACACACCCTG AACCTTACCT ACAATTCgac gtcactttgt. acggtactat taaggctggt 1260 
1261 gttgagactt cccgctctgt atttcaccag aacggccaag ttactgaagt tacaaccgct 1320 
1321 accggcatcg ttgatttggg ttcgaaaatc ggcctcaaag gccaagaaga cctcggtaac 1380 
1381 ggcctgaaag ccatttggca ggttgagcaa aaagcatcta tcgccggtac tgactccggt 1440 
1441 tggggcaacc gccaatcctt catcggcttg aaaggcggct ccggtaaatt gcgcgteggt 1500 
1501 cgtttgaaca gcgtcccgaa agacacoggc gacatcaatc cttgggatag caaaagcgac 1560 
1561 tatttgggcg taaacaaaat tgccgaaccc gagycacgcc ^tcatttccgt acgetacgat 1620 
1621 tctcccgaat ttgccggcct cagcggcagc gtacaatacg cgcttaacga caatgcaggc 1680 
1681 agacataaca gcgaatctta ccacgcoggc ttcaactaca aaaacggtgg ctccttcgtg 1740 
1741 caatatggcg gtgcctataa aagacatcat caagtgcaag agggcttgaa tattgagaaa 1800 
1801 taccagattc accgtttggt cagcggttac gacaatgatg ccctgtacgc ttccgtagcc 1860 
1861 gtacagcaac aagacgcgaa actgactgat gcttccaatt cgcacaactc ccaaaccgaa 1920 
1921 gttgccgcta ccttggcata ccgcttcggc aacgtaacgc cccgagtttc ttacgcccac 1980 
1961 ggcctcaaag gtttggttga tgatgcagac ataggcaacg aatacgacca agtggttgtc 2040 
2041 ggcgcggaat acgacttctc caaacgcact tctgccctgg tttctgccgg ttggctgcaa 2100 
2101 gaaggcaaag gcgaaaacaa attcgtagcg actgccggcg gtgttggtct gcgtcacaaa 2160 
2161 ttctaaCCTA GGCCCCCCGC GAATTAAtTC CCCTTAGACA TCACTCTTCC TCACTTCAAG 2220 
2221 TTGGCCAcTT ACGAGAAGAC CGGTCTTCCT AGATTCTAAT CAAGAGGATG TCAGAATGCC 2280 
2281 ATTTCCCTCA GAGATGCAGG CTTCATTTTT GATACTTTTT T ATTTST AAC CTATATACTA 2340 
2341 TAGGATTTTT TTTGTCATTT TGTTTCTTCT CGTACGACCT TGCTCCTGAT CAGCCTATCT 2400 
2401 CGCAGCTGAT GAATATCTTG TGGTAGGGCT TTGGGAAAAT CATTCGAGTT TGATGTTTTT 2460 
2461 CT TCa TATTT CCCACTCCTC TTCAGAGTAC AGAACATTAA CTOACAAGTT CCTTTGTGCA 2520 
2521 ACCTTATCCA TAAGCTTTAA TGCCGTACTT TATCACACTT AAATTCCTAA CGCACTCAGC 2580 
2581 CACC CTGTAT GAAATCTAAC AATGCGCTCA TCGTCATCCT CGCCACCCTC ACCCTGCATC 2640 
2641 CTGTACCCAT AGC CTTGGTT ATCCCGCTAC TCCCGCGCCT CTTCCGGCAT ATCGTCCATT 2700 
2701 CCGACAGCAT CGCCACTCAC TATGGCCTCC TOCTAOCGCT ATATGCCTTC ATGCAATTTC 2760 
2761 TATCCGCACC CCTTCTCGGA OCACTGTCCG ACCGCTTTGC CCCCCGCCCA CTCCTGCTCC 2820 
2821 CTTCOCTACT TGGAGCCACT ATCGACTACG CGATCATGGC GACCACACCC GTCCTCTGGA 2880 
2881 TCTATCCAAT CTAAATGTAA CTTAAAATCT CTAAATAATT AAATAAGTCC CA GTTTCTCC 2940 
2941 ATACGAACCT TAACAGCATT GCGGTGAGCA TCTAGACCTT CAACAGCAGC CAGATCCATC 3000 
3001 ACTCCTTCCC CAATATCTTT CACTCCCTCA GGAGTTACGT CTTCTCAAGT CATGAACTTC 3060 
3061 TCGAACCTTG CAGTGTTAAC TCCGCTCTAT TGACGGGCAT ATCCGTACGT TGGCAAACTC 3120 
3121 TCCTTGGTAC CCGACGAGTA ATCTCCACAA CTCTCTCGAG AGTAGGCACC AACAAACACA 3180 
3181 GATCCAGCGT CTTCTACTTG ATCAACATAA CAAGAACCAT TCTCGATTTC CAGGATCAAC 3240 
3241 TCTTCACCAG CGTACTCATT GCACATTTCC AAAGCCTGCT CCTAGGTTCC AACCCATACC 3300 
3301 CTTGTACACT CTGCAATACA CTTCCGTACA ATTTCAACCC TTGCCAACTC CACAGCTTCG 3360 
33 61 TTGTCAACAC CATCTTCAAT TCTGGCAAGC TCCTTCTCTG TCATATCGAC AGCCAACAGA 3 420 
3421 ATCACCTGGG AATCAATACC ATCTTCAGCT TGAGACACAA CCTCTGAGGC AACCAAATCT 3480 
3481 CCATCACCCT ATTTATCAGC AATAACTAG A ACTTCACAAC GCCCACCACC CATGTCAATA 3540 
3541 CTACACACGG CTCATGTGTC ATTTTCAACC ATCATCTTCG CACCACTAAC CAACTGGTTT 3600 
3 601 CCTGG ACC A A ATATTTTCTC ACACTTACCA ACACTTTCTG TTCCCTAACC CAT A GC AGCT 3660 
3661 ACTCCCTCCC CCCCTCCTCC TAGCACCATA CACTTACCAC C A A CCTTGTG GCCAACCTAC 3720 
3721 ATCACTTCTG GGGTAACCCT ACCATCCTTC TTACCTGGAG ATCCAAAAAC AATTTCTTTG 3780 
37 Bl CAACCACCAA CTTTCGCACG AACACCCACC ATCACGCAAC TGCAACCCAG AATTCCCCTT 384 0 
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3041 OC ACCAO QAA TATAGfUOOCC AACTITCTCA 
3901 OC A QQOCAAO TCTCAACTTO CAACCTCTCC 
3961 TTATCTATAG AGAOATCAAT OOCT CTCTTA 
4021 OCCA AAOCAO CTTCTAACAC AGSTOXCXTC 
40B1 TCT AAAAGGO CTTTCTCACC ATTTTGACGA 
4141 ATAATCTCTT CO OTTTTCTQ GATAGCACCA 
4201 OCC T TAGAAA CCTCAATTTT GCACAATTCA 
4261 TTGGATTCTT CTTTAGCTTC TICC TI WIX; 
4321 CTGACCTTTA CCCACTTCAT ATCCACGTTT 
4381 TTOCC ACATC TAACTAATGC AAAATAAAAT 
4441 TTGCATTTAG CTTCTOCAAG TTCATCAOCT 
4S01 TCCTCCTCAA ATAACCGTIT CCTATAAOAA 
4561 AAGGTGGCTT CCATOCCTCT AAGACCCTTT 
4621 TCACACAAAC CAACACCTCT TTCTTCAACC 
4681 AATTCCATAC CCAGCAACTT TTGACTTOCT 
4741 TOACCACCAG ATTGGTAGAC TCCAGTTTCT 
4801 GACGACTACA CCAGGCCCAA CGAGTAATTA 
4861 CC ATCGG GGC GGTCAGTACT CAAAGAGGCC 
4921 ACATCTTCAC AAACTTCGTA TTCAGTAGTC 
4981 OCAGAAGCAA CAGTGGAACT CACATCTACC 
5041 CTTCTACTAC CGCCATTAGT GAAACTTTTC 
S101 CCCATACTAC CATTAGCGGG CAAGGATCCA 
5161 CTAGCGCCTG CGATCATCCT TTGGACAACT 
5221 TCATTGATAC CATTATTCTA CAACTTGACC 
5281 TCTGTAACGG ATCACTCAAC TTCCACATTA 
5341 ATCAGCTTCT GCACCTCGTC AGCAGCATAG 
5401 CAATTATCAA ACTCTCCAAC ACTTGCCTAT 
5461 GTCCCACACT GACTGTAGTC TTGAGAAATT 
5521 AGTCATC A C G AGATCCTCTA CGCCGGACGC 
5581 CCCGCCACAC GTGCCGTTGC TGGCGCCTAT 
5641 CCTCGCCACT TCCCGCTCAT CACCCCTTGT 
S701 GCCCGGGCAC TCTTCOGCGC CATCTCCTTG 
5761 AACGOCCTCA ACCTACTACT OCGC TO CTT C 
5821 CGAGTATCTA TGATTGGAAG TATCGCAATC 
5881 TCTCC TATCA GATTATGCCC AACTAAACCA 
5941 TCTGACTTTT CCTCATCAGT ACACTCGAAC 
6001 ATCTCCTTCT TGGAGACAGT AAATGAAGTC 
6061 ACAAACTTCT TGTTTCCAAC TTTTTCOCTC 
6121 TCCCGTACGA ATCGAOCCCC CAAATCCTTA 
6181 TTCTACATAC TCATCCCAAC TTCACTGACA 
6241 TOCACAGAAA TCAAACTTCT TTGTCTACTA 
6301 ACAATACTCT CC PCCTOTTT TCACGTCATC 
6361 AATAATCTTG ACGACCCAAG GCGATAAATA 
6421 AAGGACAACT ATCTCTGCCT CTATTAAACC 
6481 AC TTCA CCCC CACTATCTTC TTTTAGAGAA 
6S41 TACCCTCATT TTAAACGTGA AATTTATCTC 
6601 ACGGTGAAAA CCTCTCACAC ATCCAGCTCC 
6661 ATCCCCGG A G CAGACAACCC CCTCACOGCG 
6721 CACCCATCAC CCACTCACGT AGCGATACCC 
6781 AGACCAGATT GTACTGAGAG TGCACCATAT 
6841 G ACAA AATAC CCCATCAGCC GCTCTTCCGC 
6901 CGTTCGCCTC CCGCGAGCCG TATCACCTCA 
6961 ATCACGGCAT AACGCAGGAA AGAACATGTG 
7021 TAAAAACGCC CCGTTGCTGC CCTTTTTCCA 
7081 ^ AATCCACCC TCAACTCAGA CCTCGCGAAA 
7141 TCCCCCTCCA ACCTCCCTCQ TGCCCTCTCC 
7201 CTCCOCCTTT CTCCCTTCGC GAAGCGTCCC 
7261 CAGTTCGGTG TAGGTCGTTC CCTCCAACCT 
7321 CGACCCCTGC CCCTTATCCG CTAACTATCC 

73 Bl ATCCCCACTC GCAGCAGCCA CTGCTAACAG 

74 41 TACAGAGTTC TTGAAGTGGT OGCCTAACTA 
7501 CTCCGCTCTG CTGAACCCAG TTACCTTCGC 
7561 ACAAACCACC CCTGGTAGCG CTC GTTTTTT 
7621 AAAAGGATCT CAAGAAGATC CTTTGATCTT 
7681 ^ AACTCACCT TAACC GATTT TGCTCATGAC 
7741 TTTaaAttaa aaatgaagtt ttaaatcaat 
7801 cagttaccaa tgcttaatca . gtgaggcacc 
7861 catagtcgcc tgactccccg tcgtgtagat 
7921 ccccagcgct gcaatgatac cgcgagaccc 
7981 aaaccagcca gccggaaggg ccgaycgcag 
8041 ccagtccatt aattgttgcc gggaagccag 
8101 caacgctgtt gccattgctg caggcatcgt 



ATA CCTCTT G CAAAAOGAGA CCACACTA CA 3900 
OTTACTTOAG CTTCATOGAA TTTOCTCACG 3960 
ACCTTATCTO GCAATTCCAT AAGTTCCTCT 4020 
AAAGCCACTC CATCAAACTT OGCAGTTAGT 4080 
ACATTCTCGA CAATTOGTTT GACTAATTOC 4140 
OOAAGGGCAT CTTCAATTTC TTCT OA CGAQ 4200 
ATAOOACC TT CAGAACOGAC TTCTTTAOC3T 4260 
T A TOCTOQCT TGCCATCTCC TTTCCTTCTA 4320 
CTCTCCACCT CCTCCAACCT CACACCCTAC 4360 
AAGTCAGCAC ATTCCCAOCC TAT ATCTTCC 4440 
TCCTCCCT AA TTTTAGCCTT CAACAAAACT 4500 
OCTTCTGGAC CATTGCTCTT ACGATCCCAC 4560 
CATTCCCCAA AACAGOAAGT GCCTTCCAAC 4620 
ACAAATTTCA AOCAGTCTCC ATCACAATOC 4680 
CCAGATCTAC CACCTTTATA CCACAAACCG 4740 
CTOCTT ATAO OCTCCCOAAT ACA CTTTTTO 4800 
CAACAGTCAC CCACCAAACT AGTCAATACA 4860 
AACAAAATTT CACTGACAGG CAA CTTTTTG 4920 
AATTGCCCAG CATCAATAAT CGCGATTATA 4980 
AACTTTCCCG TCTCACAAAA AGCATAAACA 5040 
AAATCGCCCA CTGCAGAAGA AAAAGCCACA 5100 
ACTTTATCAA CCAGCCTCCT ATAGATAACC 5160 
CTTTCTCCCA AATCTACCTC CAAAATCACT 5220 
AAC TTCTCC A TCAGCTCCTC AAATTCCTCC 5280 
ACTTGAACCT CACTCCATTG AGTCAACTTG 5340 
CCAAACACCG CTTTTCCTAC CAAACTCAAG 5400 
GCAGCTAGCA AGGGAAATCT CATACTTGAA 5460 
CTCAAOCCGT ATTTTTATTA TCACTGACTC 5520 
ATCGTCCCCg acetgcaggt cGCCATCACC 5580 
ATCCCCCACA TCACCGATCG CGAACATCCC 5640 
TTCCCCCTGG CTATGCTGCC ACCCCCCCTG 5700 
CATCCACCAT TCCTTOCGCC COCCCTCCTC 5760 
CTAATCCACC ACTCCCATAA CGGACAGCCT 5820 
GTGATACCCC CATTCTTCAG TCTCTTGAGG 5880 
ACCCGAGCAG CAGATTTCAT CCTAAATTTC 5940 
TGTGAGACTA TCTCGCTTAT CACACCACAA 6000 
CCACCAATAA AGAAATCCTT GTTATCAGGA 6060 
CCTTGAACTA TAAAATCTAG ACTCCATATC 6120 
CCTTCTCCAC CTTCAACACG TATCTACCCT 6180 
ACGTTOCTAT TTCCTTCAAA CCATTCCCAA 6240 
TTCATCCAAG CCAGTCCGCT CTTGAAACTG 6300 
TTTGTATCAA TAAATCTACT CTTTGATCTA 6360 
CCCAAATCTA AAACTCTTTT AAAACGTTAA 6420 
CCAAATCACC TCGTACTCTG ATCCTCATCA 648C 
ATTTCCGCAG ATGCGATATC CAGAAAAAGG 6540 
AAGATCTCTG CCTCCCGCCT TTCCGTGATG 6600 
CGGAGACGGT CACAGCTTCT CTCTAAGCCG 6660 
CCTCAOCCGG TCTTCCOGGC TCTCCGCCCC 6720 
GAGTCTATAC TGGCTTAACT ATCCGCCATC 6780 
CCGGTGTGAA ATACCGCACA CATGCGTAAG 6840 
TTCCTCCCTC ACTCACTCGC TGCGCTCCGT 6900 
CTCAAAGCCG CTAATACCCT TATCCACAGA 6960 
AGCAAAAGGC CACCAAAACC CCACGAACCG 7020 
•TAGGCTCCGC CCCCCTCACC AGCATCACAA 7080 
CCCCACAGCA CTATAAAGAT ACCACCCCTT 7140 
TGTTCCCACC CTGCCGCTTA CCGCATACCT 7200 
OCTTTCTCAA TGCTCACGCT GTAGCTATCT 7260 
GGGCTGTCTC CACGAACCCC COGTTCAGCC 7320 
TCTTGACTCC AACCCGGTAA GACACGACTT 7380 
CATTACCAGA GCGAGGTATG TAGGCCGTGC 7440 
CGCCTACACT AGAAGGACAG TATTTGGTAT 7500 
AAAAAGAGTT CCTACCTCTT CATCCCGCAA 7560 
TCTTTOCAAG CACCACATTA CGCCCAGAAA 7620 
TTCTACGGCG TCTGACGCTC AGTGGAACGA 7680 
ATTATCAAAA AGGATCTTCA CCTAGA TCCT 7740 
ctaaagtaca catgagtaaa ctcggtctga 7800 
tatcccagcg atctgtctat ttcgttcatc 7860 
aactacgata cgggagggct taccacctgg 7920 
acgctcaccg gctccagatt taccagcaat 7980 
aagcggcccc gcaacttcat ccgcctccat B040 
agtaagtagt tcgccagtta atagtctgcg 8100 
ggtgtcacgc ccgtcgttcg gcatggcttc 8160 
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•162»*tteagetcc ggttcecaac C«tcw WCfl 
•221 agcggtugc tccttoggtc cfcccaatopt 
•2B1 actcatgrgct *tggaiocAC tgeaUtttc 
•341 ttctgCgact ggtgagtact CMccMgtc 
•401 ttgctcttgc ccggcgtcoA cacyggaua 
•461 gctcatcatt ogaaaacgtt cttcaoaaca 
8S21 atccacrttcfl atgtaaccca ctqjttfcacc 
•561 cagcgtttct aagtgragcaa aaacaggaap 
•641 eacacggaaa tgttgaatac tcatactctt 
•701 aggttattgt ctcatgagcg gatacaeatt 
•761 ggttccgcffc acatttcccc gaaaagtgcc 
•821 oacattaacc tataaaaata ggcgtatcac 
•881 CATCTTTGAC ACCTTATCAT OGATAAGCTG 
•941 ATCGGGAACA CTCAAAAATA ACAGTTATTA 
I 10 I 20 I 30 
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agttaeatga tcccccatgt fcgtgcaaaaa 8220 
tgtcagaagt aagttggcog cagtgttaec 8280 
fcettactgtc atgocafceog taagatgett 8340 
attctgagaa fcagtgtatgc ogogacogag 8400 
caccgcgcca catagcagaa eecfcaaaage 8460 
aaaaetctca aggatcttac cgctgttgag 8520 
caactgatct tcagcatctt ttactttcac 8S8C 
gcaaa&tgcc gcaaaaaagg paataagggc 8640 
cct ttttcaa tattattgaa gcattcatca 8700 
tgaatgtatt cagaaaaata aacaaatagg 8760 
acctgacgtc taagaaacca ttattatcat 8820 
gaggcccttt cgtctecaaG AATTAATTCT 8880 
ACTCATOTTC GTATTGTGAA ATAGACOCAG 8940 
TTCC 8974 
I 40 I 50 \ 60 
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SNA ••gutnce 10215 b.p. AGATCTAACATC ... ACTTATTATTCC linear 



I 10 t 20 I 30 

1 ACATCTAACA TCCAAAGACG AAACCTTCAA 
61 GTCCATTCTC ACACATAACT CCCAAACCCA 
121 TCCAAACCCA CGACCTCCAC TPCTCTTCTC 
161 AGCCCACTTA TTCCGCTTGA TTCGAGCTCC 
241 ACACCATCAC TTTATIACCC T C T CTA TCCT 
301 TTTCCGAATG CAACAAOCTC OQCATTACAC 
361 AGTGTCGGGT CAAATAGTTT CATGTTOCCC 
421 CTCTTOGAAC CTAATATGAC AAAAGCGTGA 
481 TTGAAATCCT AACOGCCACT TOGTCAAAAA 
541 CTTCTTTCCT ATTGATTGAC CAATCCTCAA 
601 CTCTATCCCT TCTGAACCCG CTGGCAOCTG 
661 TTTTCGATGA TTATCCATTG TcTCCACATT 
721 OCTGATACCC TAACCTTCAT CATCAAAATT 
781 ATATAAACAG AAGCAACCTC CCCTCTCTTA 
841 ACTTTCATAA TTGCCACTGG TTCCAATTCA 
901 CAACTTCAGA AGATCAAAAA ACAACTAATT 
961 TCAATTTTTA CTCCAGTTTT ATTCGGAGCA 
1021 ACAACAGAAC ATGAAACGGC ACAAATTCCG 
1081 GAACCCGATT TCGATGTTCC TCTTTTCCCA 
1141 TTTATAAATA CTACTATTCC CACCATTCCT 
1201 AGACAGGCTC AAGCTTACGT AGAATTCgac 
1261 gttgagactt cccgctctgt at tt caeca g 
1321 accggcatcg ttgatttggg ttcgaaaatc 
1381 ggcctgaaag ccatttggca ggttgagcaa 
1441 tggggcaacc gccaatcctc categgcttg 
IS 01 cgtttgaaca gcgtcctgaa agacacoggc 
1561 tatttgggtg taaacaaaat tgccgaaccc 
1621 tctcccoaat ttgccggcct cagcggcagc 
1681 agacataaca gcgaatctta ccacgccggc 
1741 caatatggcg gtgcctataa aagacatcat 
1801 taccagattc accgtttggt cageggttae 
1861 gcacagcaac aagacgcgaa actgactgat 
1921 gttgeegcta ccttggcata ccgcttcggc 
1981 ggcttcaaag gtttggttga tgatgeagae 
2041 ggtgcggaat acgacttctc caaacgcact 
2101 gaaggcaaag gcgaaaacaa attegtageg 
2161 ttctaaCCTA GGCCCGCCCC CAATTAATTC 
2221 TTGCCCAcTT ACCAGAAGAC CGGTCTTCCT 
2281 ATTTGCCTGA GAGATOCAGG CTTCATTTTT 
2341 TAOGATTTTT TTTGTCATTT TGTTTCTTCT 
2401 CGCAGCTCAT GAATATCTTG TCGTACCGGT 
2461 CTTGGTATTT CCCACTCCTC TTCAGAGTAC 
2521 AGCTTATCG A TAAC CTTT AA TGCGGTACTT 
2581 CACCGTGTAT GAAATCTAAC AATGCOCTCA 
2641 CTGTAGGCAT AGCCTTGCTT ATOCCOCTAC 
2701 CCGACACCAT CGCCAGTCAC TATGGCGTCC 
2761 TATCCGCACC CGTT C T CCG A OCACTCTCCG 
2821 CT1 CO C T ACT TGGAGCCACT ATCGACTACG 
2881 TCTATCGAAT CTAAATCTAA GTTAAAATCT 
2941 ATACCAACCT TAACAOCATT OCGGTGAGCA 
3001 ACTCCTTCCC CAATATCTTT CACTCCCTCA 
3061 TCGAACGTTC CACTGTTAAC TCCCCTGTAT 
3121 TCCTTCCTAC CGGAGGACTA ATCTCCACAA 
3181 CATCCACCCT GTTGTACTTG ATCAACATAA 
3241 TCTTCAGGAC CGTACTGATT GGACATTTCC 
3301 GTTGTAGACT GTGCAATACA CTTGCGTACA 
3361 TTGTGAACAC CATCTTCAAT TCTGGCAACC 
3421 ATCACCTCCG AATCAATACC ATC TTCACCT 
34 81 CGATCAGCCT ATTTATCACC AATAACTAGA 
3 541 CTACACAGCG CTGATCTCTC ATTTTCAACC 
3601 CCTGGACCAA ATA TTTTGT C AC ACTTACG A 
3661 ACTGCCTCCC CCCCTCCTCC TAGCACCATA 
3721 ATGACTTCTC CCGTAAGCCT ACCATCCTTC 
37 81 CAACCACCAA C T TT GGCAGG AACACCCACC 
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TCAAACCTTT TTCCCATCCC ACATCCACAG 60 
ACAGGAGCGG ATACACTACC ACCACACCGT 120 
CTCAACACCC ACTTTTCCCA TOGAAAAACC 180 
CTCATTCCAA TTCCTTCTAT TAQOCTACTA 240 
GOCCCOCCTG OCCAGCtTCA TCTTTCTTTA 300 
CCGAACATCA CTCCACATGA GOGCTTTCTC 360 
AAATGCCCCA AAACTGACAG TTTAAACOCT 420 
TCTCATCCAA GATGAACTAA GTTT CCTTCC 480 
GAAACTTCCA AAACTCCCCA TACOGTTTCT 540 
AAATAATCTC ATTAAT0CTT AGCGCAGTCT 600 
TGOCGAAACG CAAATGGCGA AACaCCCCCT 660 
GTATGCTTCC AACATTCTGG TOGCAATACT 720 
TAACTGTTCT AACCCCTACT TCACAGcAAT 780 
AA CCTTTTTT TTTATCATCA TTATTAGCTT 840 
CAACCTTTTG ATTTTAACGA CTTTTAACGA 900 
ATTCGAAGGA TCCAAAOGat gAGATTTCCT 960 
TCCTCCGCAT TACCTCCTCC AGTCAACACT 1020 
GCTGAACCTC TCATCGGTTA CTCAGATTTA 1080 
TTTTCCAACA CCACAAATAA CGGGTTATTC 1140 
GCTAAAGAAG AAGGGGTATC TCTCCAGAAA 1200 
gtcactttgt aeggtactat taaggctggt 1260 
aacggccaag ttactgaagt tacaaccgct 1320 
ggcttcaaag gecaagaaga cctcggtaac 1380 
aaagcatcta tcgccggtac tgactccggt 1440 
aaaggegget teggtaaatt gcgcgtcggt 1500 
gacatcaatc cttgggatag caaaagegae 1560 
gaggcacgcc teatttcegt acgctacgac 1620 
gtacaatacg egcttaaega caatgeagge 1680 
tccaactaca aaaacggtgg ettcttegtg 1740 
caagtgcaag agggcttgaa tattgagaaa 1800 
gacaatgatg ccctgtacgc ttccgtagcc 1860 
gcttccaatt cgcacaactc ccaaaccgaa 1920 
aaegtaaege cccgagtttc ttacgcccac 1980 
ataggcaacg aatacgacca agtggttgtc 2040 
tctgccctgg tttctgeegg ttggttgcaa 2100 
actgccggcg gtgtcggtct gcgccacaaa 2160 
GCCTTAGACA TGACTCTTCC TCAGTTCAAC 2220 
AGATTCTAAT CAAGAGGATC TCAGAATGCC 2280 
CATA CTTTTT TATTTCTAAC CTATATAGTA 2340 
CCTACGACCT TGCTCCTGAT CACCCTATCT 2400 
TTCGGAAAAT CATTCCAGTT TGATCTTTTT 2460 
AGAAGATTAA GTGAGAACTT CCTTTGTGCA 2520 
TATCACAGTT AAATTCCTAA CCCAGTCAGG 2580 
TCCTCATCCT CGGCACCGTC ACCCTGCATC 2640 
TGCCGOCCCT CTTGCGGGAT ATCCTCCATT 2700 
TCCTAGCCCT ATATCCCTTG ATGCAATTTC 2760 
ACCG CTTTC Q OCGCCGCCCA CTCCTGCTCG 2820 
CGATCATGGC GACCACACCC G TCCTGT GCA 2880 
CTAAATAATT AAATAACTCC CAGTTTCTCC 2940 
TCTAGACCTT CAACACCACC CAGATCCATC 3000 
CCAGTTACCT CTTCTGAAGT GATGAACTTC 3060 
TGACGGCCAT ATCCGTACCT TGGCAAAGTG 3120 
CTCTCTGGAG ACTAGCCACC AACAAACACA 3180 
CAAGAAGCAT TCTCCATTTG GACCATCAAC 3240 
AAACCCTCCT CGTAGGTTGC AACCGATACG 3300 
ATTTCAACCC TTGGCAACTG CACACCTTCG 33 60 
TCCTTCTCTG TCATATCGAC ACCCAACAGA 3420 
TCAGACAGAA GGTCTGAGGC AACGAAATCT 3480 
ACTTCACAAG GCCCAGCAGG CATGTCAATA 3540 
ATCATCTTCC CAGCAGTAAC GAACTCGTTT 3600 
ACA GTTTC TG TTCCCTAAGC C ATAGCA GCT 3660 
CACTTACCAC CAACCTTCTC GCCAACCTAG 3720 
TTACCTCCAC ATCCAAAAAC AATTTCTTTG 37 60 
ATCACGCAAC TCGAACGCAC AATTCCCCTT 3840 
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3*41 CCACCAOOAA TATAOAOOCC JUUSTTCICA 
3901 CCACGCCAAC TCTCAAjCXTO CAACOTCTtt 
3961 TTATCTATAG AX2AGATCAAT OOCTOTCTW 
402- CGGAAACGAC CTTCTAACAC ACOTOTcSc 
4081 TCTAAAACGG CTTTCTCACC JOTTtSoS 

2^ ^^ 7Gn ««S5S5 

f 20 * f?^ rrAGAAA CCTCAATTTT OCACAATTCA 
-_^1 TTGGATTCTT CTTTAOCTTO TTCCTTOgTO 
4321 CTCACCTTTA CGCACTTCAT ATCCAOOTTT 
4381 TTX3CCACATC TAACTAATOC AAAATAAAAT 
4441 TTCGATTTAG CTTC T OC AAO TTCATCAX9CT 
4501 TCCTCGTCAA ATAACCGTTT CCTATAAOAA 
4561 AACCTCGCTT CCATCGCTCT AAGACCCTTT 
4621 TCACAGAAAC CAACACCTCT TTGTTCAACC 
4681 AATTCGATAC CCAGCAACTT TTOAGTTCCT 
4741 TCACCACOAG ATTGGTAGAC TOCAOTTTCT 
4801 CACGAGTACA CCAGCCCCAA OGAGTAATTA 
4861 CC ATCCGG CC GCTCAGTAGT CAAAGAOGCC 
4921 ACATCTTCAG AAAGTTCGTA TTCAGTA6TC 
4981 CCACA ACCAA CAGTGCAAGT CACATCTACC 
5041 CTTCTACTAC CCCCATTACT GAAACTCTTC 
5101 CCCATACTAC CATTAGOOCG CAACGATOCA 
5161 CTAGCGCCTC GGATCATOCT TTGCACAACT 
5221 TCATTCATAC CATTATTGTA CAACTTCAGC 
S281 TCTGTAACGC ATGACTCAAC TTGCACATTA 
5341 ATCACCTTGT CCACCTCCTC ACCAGCATAG 
5401 GAATTATCAA ACTCTGCAAC ACTTCCGTAT 
5461 CTCCGACAGT GAGTCTAGTC TTGAGAAATT 
5521 ACTCATCACG ACATCCTCTA CGCCGGACCC 
5581 CCCTGACCTC TCCCTCCTGA AGAAGGTCTT 
5641 TCATCCAGCC AGAAACTCAG OGACCCACCG 
5701 TTGCTCATTT TCAACTTTTG CTTTGCCACC 
5761 ATCTCATCCT TCAACTCAGC AAAAGTTOGA 
5821 TCACCGTAAT CCTCTGCCAG TCTTACAACC 
5881 CGACCATCAA ATCAAACTGC AATTTATTCA 
5941 AAAGCCCTTT CTCTAATCAA GGAGAAAACT 
6001 CCTGGTATCC CTCTCCCATT CCGACTCCTC 
6061 CGTCAAAAAT AACCTTATCA ACTGAGAAAT 
: 6121 ATCGCAAAAG CTTATGCATT TCTTTCCAGA 
6181 CATCAAAATC ACTCCCATCA ACCAAACCCT 
6241 GAAATACCCG ATCCCTOTTA AAAGGACAAT 
€301 CCAACACTGC CACCCCATCA ACAATATTTT 
6361 GGAATCCTCT TTTCCCGCGG ATCGCAGTGG 
6421 TAAAATCCTT GATCCTCCCA AGAGGCATAA 
6481 CATCTGTAAC ATCATTGGCA ACGCTAOCTT 
6541 CCCCCTTCCC ATACAATCCA TACATTGTCG 
6601 iZH ATACC ° ATATAA ATCA CCATCCATCT 
6661 TT TCCCCT TG AATATGGCTC ATAACACCCC 
6721 TTATTCTTCA TGATCATATA TTTTTATCTT 
6781 CAACCTGCCT TTCCCCCCCC CCCCTGCACG 
6841 CTCGCGCCTA TATCGCCCAC ATCACCGATG 
6901 TGAGCCCTTC TTTCCCCGTG GGTATCGTGG 
6961 CCATCTCCTT GCATGCACCA TTCCTTOCCG 
7021 TCCGCTCCTT CCTAATGCAG GACTCGCATA 
7081 GTATGGGAAT CGTGATACCC OCATTCTTCA 
7141 CAACTAAAGC AACCGGAGGA CGAGATTTCA 
7201 TAGACTCCAA CTCTGAGACT ATC TCOGT T A 
7261 TAAATG AAGT CCCACCAATA AAGAAATCCT 
7321 CTTTTTCCGT CCCTTCAACT ATAAAATCTA 
73 Bl GCAAATGCTT ACCTTCTGGA CCTTCAAGAG 
7441 CTTCACTGAC AACCTTGCTA TTTCCTTCAA 
7501 TTTCTCTACT ATTCATCCAA CCCAGTCCCC 
7561 TTCACCTCAT CTTTCTATCA ATAAATCTAC 
7621 CCCCATAAAT ACCCAAATCT AAAACTCTTT 
7681 TCTAT TAAAC CCCAAATCAC CTCGTAGTCT 
77 41 CTTTTACAGA AATTTGCCGA GATGCGATAT 
7801 AAATTTATCT CAAGATCTCT CCCTCCCGCC 
7861 CATGCAGCTC CCCGACACCC TCACACCTTG 
7921 CCCTCAGCCe GCCTCAGCCG GTGTTGGCGG 
7981 TAGCGATAC C CGACTGTATA CTCGCTTAAC 
8041 CTCCACCATA TCCCCTCTGA AATACCGCA C 
8101 CCCTCTTCCC CTTCCTCCCT CACTCACTCC 
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ATAOCTCTTO CAAAAOCAGA CCAGA CTACA 3900 
OTTACTTOAO CTTCATCGAA TTTOCTOACC 3960 
ftOCTTATCTG CCAATTCCAT AACTTCCTCT 4020 
AAAOCGACTC CATCAAACTT COCACTTAGT 4080 
ACATTCTCCA CAATTCGTTT CACTAATTCC 4140 
OGAACGGCAT CTTCAATTTC TTGTGAGGAG 4200 
ATACCACCTT CACAAGCCAC TTCTTTACCT 4260 
TATCCTOOCT TCGCATCTCC TTTCCTTCTA 4320 
CTCTCCACCT CGTCCAACCT CACACCGTAC 4380 
AACTCACCAC ATTCCCAGGC TATATCTTCC 4440 
TCCTCOCTAA TTTTAGOCTT CAACAAAACT 4500 
OCTTCT00AG CATTCCTCTT ACGATCCCAC 4560 
GATTCCOCAA AACAGGAAGT CCCTTCeAAG 4620 
ACAAATTTCA AGCAGTCTCC ATCACAATCC 4680 
OCACATCTAC CACCTTTATA CCACAAACCG 4740 
GTCCTTATAG CCTCCGCAAT AGACTTTTTG 4800 
CAAGAGTCAG OCACCAAAGT AGTGAATAGA 4860 
AACAAAATTT CACTGACACG CAACTTTTTC 4920 
AATTGOCGAC CATCAATAAT GGGGATTATA 4980 
AACTTTGCCG TCTCAGAAAA AGCATAAACA 5040 
A AATCC CCCA GTCGAGAAGA AAAAGGCACA 5100 
ACTTTAT CAA CCACGCTCCT ATACATAACC 5160 
CTTTCTCCCA AATCTAGCTC CAAAATCACT 5220 
AACTTGTCGA TCACCTCCTC AAATTCCTCC 5280 
ACTTGAAGCT CACTCCATTG AGTGAACTTG 5340 
GGAAACACCG CTTTTCCTAC CAAACTCAAG 5400 
CCACGTAGCA AGGCAAATGT CATACTTGAA 5460 
CTGAAGCCCT ATTTTTATTA TCAGTGACTC S520 
ATCGTGGCCC ACCTCCACCg QQQQQQQQgG 5580 
CCTGACTCAT ACCAGGCCTG AATCCCCCCA 5640 
TTCATCAGAC CTT T CTPCT A ^CCTGCACCAG 5700 
CAACCCTCTG CCTTCTCCCG AACATCCGTC 5760 
TTTATTCAAC AAAGCCCCCC TCCCCTCAAG 5820 
AATTAACCAA TTCTGATTAG AAAAACTCAT 5880 
TATCAGGATT ATCAATACCA TATTTTTCAA 5940 
CACCGAGGCA GTTCCATACG ATCGCAAGAT 6000 
CAACATCAAT ACAA CCTATT AATTTCCCCT 6060 
CACCATGACT CACCACTCAA TCCCCTGAGA 6120 
CTTCTTCAAC AGGCCAOCCA TTACGCTCCT 6180 
TATTCATTCG TCATTGCCCC TGACCCACAC 6240 
TACAAACAGC AATCGAATGC AACCCGCGCA 6300 
CACCTGAATC AGCATATTCT TCTAATACCT 6360 
TCACTAACCA TGCATCATCA GGAGTACCGA 6420 
ATTCCGTCAC CCACTTTAGT CTCACCATCT 6480 
TOCCATCTTT CAGAAACAAC TCTGCCGCAT 6540 
CACCTGATTC CCCGACATTA TCCCCACCCC 6600 
TCGA ATTTAA TCCCCGCCTC GACCAAGACG 6660 
TTGTATTACT GTTTATGTAA GCAGACAGTT 6720 
CTGCAATGTA ACATCAGAGA TTTTGAGACA 6780 
TCCCCATCAC CGCCGCCACA GGTCCCCTTC 6840 
GGGAAGATCG GGCTCGOCAC TTCGCCCTCA 6900 
CAGGCCCCCT CGCCGGGGGA CTCTTCGCCG 6960 
CGCCCGTCCT CAACCCCCTC AACCTACTAC 7020 
AGGGAGAGCG TCGACTATCT ATGATTGCAA 7080 
CTCTCTTGAC CTCTCCTATC AGATTATCCC 7140 
TGCTAAATTT CTCTCACTTT TOGTCATCAG 7200 
TGAC ACCACA AATCTCCTTC TTCGACACAC 7260 
TCTTATCAGG AACAAACTTC TT C T TTCC AA 7320 
GAGTGCATAT GTCGGGTAGG AATGCAGCGG 7380 
GTATGTAGGC TTTCTAGATA CTGATGCCAA 7440 
ACCA TTCCGA ATCCACAGAA ATCAAAGTTG 7500 
TCTTGAAACT CACAATAGTG TGCTCGTCTT 7560 
TCTTTCATCT AAATAATCTT GACGAGCCAA 7620 
TAAAACCTTA AAACCACAAG TATGTCTGCC 7680 
CATCCTCATC AACTTGACCC CCACTATCTT 7740 
CGAGAAAAAC CTACCCTCAT TTTAAACGTC 7800 
TTTCCCTCAT GACCCTGAAA ACCTCTCACA 7860 
TCTGTAAGCG CATGCCCCGA GCAGACAAGC 7920 
GTGTCGCCGC GC A CCCATC A CCCACTCACC 7980 
TATGCGGCAT CAGACCAGAT TGTACTGAGA 8040 
AGATCCCTAA CGAGAAAATA CCCCATCAGC 8100 
CTGCCCTCCG TCCTTCCGCT GCCCCGAGCC 8160 
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8X61 CTATCACCTC actcaaagoc cctaataccc 
8221 AAQAACATGT CAOGAAAAGG CCAOCAAAAC 
8281 OCOTXTTTCC ATAGOCTOCO OCCCCCTGAC 
8341 ACCTGGCGAA ACCCGACAGG ACTATAAAGA 
8401 CTOCOCTCTC CTGTTOCOAC OCTOOOOCTT 
8461 OGAAOCGTGG O OCTTICIC A ATOCTCAOOC 
8521 COCTCCAAGC TGGGCTCTGT OCACGAACCC 
8S81 CCTAACTATC CTCTTCACTC CAACCCOCTA 
8641 ACTCCTAACA CGATTAOCAC AOCGAGGTAT 
8701 TGGCCTAACT ACCCCTACAC TAGAACOACA 
8761 CTTACCTTCC GAAAAAGAGT TCGTAOCTCT 
8821 OGTO GTTTTT TTGTTTCCAA OCAGCAGATT 
8881 CCTTTCATCT TTTCTAOGGO CTCTGAOCCT 
8941 TTCGTCATGA GATTATCAAA AAGGATCTTC 
9001 TTTAAATCAA TCTAAAGTAT ATATGAGTAA 
9061 ACTGAGGCAC CTATCTCAGC GATCTGTCTA 
9121 CTCCTCTAGA TAACTACGAT ACGGGAOGCC 
9181 OCGCGAGACC CACOCTCACC GGCTCCAGAT 
S241 OCOGAGCOCA GAACTOCTCC TGCAACTTTA 
9301 CGGGAAGCTA GAGTAAGTAG TTCOCCAOTT 
9361 OCAGOCATCG TGGTCTCACC CTCCTCQTTT 
9421 OGATCAAGGC GACTTACATG ATCCCOCATG 
9481 CCTCCGATCG TTCTCAGAAG TAACTTOGCC 
9S41 CTGCATAATT CTCTTACTCT CATGCCATCC 
9601 TCAACCAACT CATTCTGAGA ATA GTGTATC 
9661 ACACGGGATA ATACCOCGCC ACATAGCAGA 
9721 TCTTCCGCGC GAAAACTCTC AACGATCTTA 
9781 ACTCCTGCAC CCAACTGATC TTCAGCATCT 
9841 AAAACACCAA CCCAAAATGC CGCAAAAAAG 
9901 CTCATACTCT TC CTTTTT CA ATATTATTGA 
9961 CGATACATAT TTGAATGTAT TTAGAAAAAT 
10021 CCAAAAGTGC CACCTGACGT CTAAGAAACC 
10081 AGCCGTATCA CGAGGCCCTT TCCTCTTCAA 
10141 TCGATAACCT CACTCATCTT GCTATTGTGA 
10201 AACACTTATT ATTCG 
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TTATCCACAG AATCAGOGCA TAAOOCAOCA 8220 
OCCAGGAACC CTAAAAACGC CQOCTTQCTG 8280 
GAOCATCACA AAAATCGACC CTCAAGTCAG 8340 
TAOCAOGCCT TTCOOCCTGC AACCTCOCTC 8400 
AOC3QGATACC TCTCCC3CCTT TXriXX'CTTLS G 8460 
TGTAOGTATC TCACTTCGCT GTAGCTCCTT 852C 
OCCGTTCAGC OCCACCOCTG COCCTTATCC 8560 
AGACACCACT TATCGCCACT GOCAGCAOCC 8640 
GTAGGOGGTG CTACAGAGTT CTTGAACTGC 8700 
GTATTTCCTA TCTCCGCTCT GCTGAAGOGA 8760 
TGATCOGGCA AACAAACCAC COCTCGTAGC 8620 
ACGCGGAGAA AAAAAGGATC TCAAGAAGAT 8880 
CAGTGGAACG AAAACTGACG TTAAGGGATT 8940 
ACCTAGATCC TTTTAAATTA AAAATGAAGT 9000 
ACTTGGTCTC ACAGTTACCA ATGCTTAATC 9060 
TTTCGTTC AT CCATAGTTGC CTGACTCCCC 9120 
TTACCATCTG OCCXXAGTGC TGCAATGATA 9180 
TTATCAOCAA TAAACCAGCC AGOCGGAAGG 9240 
T CCOCCTCCA TCCACTCTAT TAA TTOTTC C 9300 
AATACTTTGC GCAACGTTGT TOCCATTOCT 9360 
GGTATGGCTT CATTCAOCTC C OGT T C CCAA 9420 
TTGTGCAAAA AAOCGGTTAG C T CC TT COCT 9480 
OCAGTCTTAT CACTCATCGT TATGCCAOCA 9540 
GTAAGATGCT TTTCTGTGAC TGGTGAGTAC 9600 
CGGCGACCGA GTTGCTCTTC CCCGGCGTCA 9660 
ACTTTAAAAG TCCTCATCAT TGGAAAACGT 9720 
OCGCTCTTGA GATCCAGTTC CATCTAACCC 9780 
TTTACTTTCA CCACCGTTTC TGCGTGAGCA 9840 
GGAATAAGCG CCACACCGAA ATGTTGAATA 
AGCATTTATC ACCGTTATTC TCTCATGAGC 
AAACAAATAG OGCTTCCGCG CACATTTCCC 
ATTATTATCA TGACATTAAC CTATAAAAAT 
GAATTAATTC TCATCTTTGA CACCTTATCA 
AATAGACGCA GATCCGGAAC ACTCAAAAAT 
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